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ON THE FORMS OF THE PREDICATES IN THE THEORY OF 
CONSTRUCTIVE ORDINALS (SECOND PAPER).* 


By S. C. KEENE. 


In the first paper of this title, the author presented a technique for use 
in attempts to reduce inductive definitions to explicit definitions, and claimed 
by means of it to show that the predicates ae O and a <ob of the system S; 
of notation for ordinal numbers are expressible in the respective forms 
(x) and (x)(Hy)S(a,b,z,y) where R and S are primitive 
recursive predicates. To apply the technique, the inductive definition of a 
predicate P is first rewritten as an equivalence, and then a solution of the 
latter for P as an unknown predicate is sought. 

A fallacy occurs in the preliminary step:* the equivalence does not fully 
express the inductive definition, but may admit other solutions for P than the 
predicate defined by the inductive definition. It is thus necessary to supple- 
ment the application of the technique by a proof that the particular solution 
obtained is not “extraneous”. It appears now that this supplementary 
reasoning can be given for the applications of the technique in which the 
proposed solution has only an existential quantifier, but fails in the final 
application (to ae Q in 14) where there is also a universal quantifier. 

In the present paper we shall show that the result claimed in the first 
paper becomes correct, if a variable a for a 1-place number-theoretic function 
is substituted for the number variable z; i.e., ae O and a <9) are expressible 
in the respective forms (a) (Hy)R(a,a,y) and («)(Hy)S(a,b,a,y) where R 
and S are primitive recursive (29 Theorem IT).* Conversely, every predicate 

* Received December 4, 1953. The references to [13] and [15] were added June 3, 
1954. The ideas for the proof of Theorems I and II were obtained in March 1950 while 
the author was on research leave supported by the University of Wisconsin Graduate 
School and held a John Simon Guggenheim Memorial Fellowship. The first paper of 
this title appeared in this JourNAL, vol. 66 (1944), pp. 41-58. 

* This is the error noted in the bibliographical references to the first paper in [10] 
and [11]. 

* A function or predicate with variables for 1-place number-theoretic functions as 
well as for natural numbers we call primitive recursive (as in [12]), if as a function 
or predicate of its number variables it is primitive recursive uniformly in its function 
variables in the sense of [11] p. 234. Like terminology has already been used in the 
case of general recursiveness in [10] pp. 681-682 and in [12]. 
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of the form («)(Fy)R(a,a,y) where R is general recursive is expressible in 
the form é(a) ¢ O where é is primitive recursive (25 Theorem I). It follows 
by the theory of arithmetical predicates and function quantifiers* that ae 0 
and a<ob are not expressible in the dual forms (Ha) (y)R(a,a,y) and 
(Ea) (y)S(a,b,a,y) with recursive R and S, respectively; a fortiori,* they 
are not arithmetical, so in particular the result claimed originally is false. 
Most of the work done in the first paper is utilized in proving the present 
results. We give an example of a primitive recursive linear ordering of the 
natural numbers which is a well-ordering with respect to the arithmetical 
functions but not with respect to all the number-theoretic functions (26 (J) ). 
Classical methods are used in this example and to some extent elsewhere. The 
results hold also for the relative version of the theory of constructive ordinals 
(in 30). 

Some of the numbering and symbolism is continued from the first paper. 
Sections 18 and 19 indicate in detail what can be kept and what must be 
rejected in that paper. Readers not having that paper at hand may begin 
with 20. For their convenience, information from that paper will be repeated 
here as needed ; and they can choose from various sources cited in the footnotes 
for the notations and results of the theory of recursive functions which are 


employed. 


18. The reduction technique. On p. 46 of the first paper, we rewrote 
the inductive definition 1-4 as an equivalence (18), assuming uncritically 
that the latter is equivalent as a definition of P(a) to the former. 
But suppose that the formal deductive system, for which the P(a) of 

1-4 is the arithmetized provability predicate, includes among its symbols D 
and among its rules of inference modus ponens 

A, ADB 

Also suppose the arithmetrization to have been carried out (as is usual) so 
that A(a), B(a,b) and C(a,6,c) are each true only when a is (the Gédel 
number of) a formula. In this case, (18) can have as solution for P(a) 
not only P(a) = {a is provable}, which we intended, but also P(a) = {a is 
a formula}. For each formula A is inferable. from two other formulas, e. ¢., 
A and A25 A, by modus ponens. Hence, when P(a) = {a is a formula}: 
For each a which makes the left side of (18) true, the third disjunctive 


$ [12] Theorem 3. 
#[12] Theorem 5. 
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member (Ex) (Hy) [P(x)& P(y)& C(a, 2, y)] of the right side, and hence the 
right side itself, will be true. Conversely, for each a which makes the right 
side true, and hence makes either A(a), or B(a,x) for some 2, or C(a,2, y) 
for some a and y, true, the left side will be true. 

What is lost in rewriting the inductive definition 1-4 as the equivalence 
(18) is part of the effect of the extremal clause 4. The predicate P(a) 
defined by 1-4 (write it now P,(a)) is that predicate P(a) satisfying (18) 
for which the set 4P(a) of the a’s which make P(a) true is the least. In 
other words, P,(a) is the solution of (18) such that, for any solution P(a) 
of (18), 

(18.1) P.(a) > P(a). 


This implication can be proved by induction, in the form which corresponds 
to the inductive definition 1-4 of Po(a). 
In particular, then (18.1) holds when P(a) is our solution 


(23) P(a) = (Er) R(a, x) 


of (18). But also, by course-of-values induction on 2 (using (13)), we can 
show from the definition (22) of R(a,x) that 


(18. 2) (a)[R(a,x) Po(a) |, 
whence (via (x)(a)[R(a,x7) >P o(a)| and (a)(x)[R(a,x) > Po(a) |) 
(18.3) (Hx) R(a,x) > Po(a). 


So, by (18.1) (applied to (Hx)R(a,x) as the P(a)) and (18.3), our solu- 
tion P(a) = (r)R(a,x) ((23) with (22)) of (18) is in fact Po(a). 

Thus with the help of the supplementary argument given now, we do 
establish that {a is provable} is expressible in the form (Hz)R(a,x) with a 
primitive recursive R. 

Likewise, at each place in the paper where we rewrote an inductive 
definition as an equivalence, we lost part of the effect of the extremal clause. 
But in every case where we solved that equivalence by a predicate of the 
form (Hx)R(a,x) or (Hx)R(a,b,x), we can show by induction on «x that 
our solution is the predicate defined by the inductive definition. 


19. Conclusions from the first paper. In this manner the conclusions 
of the first paper can be upheld, except those of 14 and 15 including Theorem 
1. and (41) in 16.° By 25 Corollary to Theorem I, we see now that Theorem 1 


*On p. 43 omit the stars on (4) and (11) (ef. [11] *86 and *95 pp. 162-163). 
On p. 52 after (V) and (VI) the “(Use (IV).)” is unnecessary. On p. 53 in (30) and 
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and (41) are false. By 25 Theorem I with [12] Theorem 5, likewise 
Theorem IX of [9] Section 15, our “proof” of which depended on Theorem 
1, is false. 


20. The predicates acO and a<ob (recapitulated).° In this paper 
we are writing T(z,2,y) for T,(z,2,y), and ®(z,z) for (=U (py 
Ts (z,%,y)))." 

Let 09 =1, (n+ 1)o—2". By saying that y defines y, recursively as 
a function of noe, we mean that (n){®(y,no) is defined and=y,}. The 
predicates ae O and a<ob of the system S; of notation for ordinal numbers 
(of the constructive first and second number classes) were introduced by the 
following inductive definition : ® 


1€0. O2. If ye O, then and y<o2. O38. If, for each n, 
yneO and Yn <o Yns1, and if y defines y, recursively as a function of no, then 
3°5%eO and, for each n, yn»<03°5%. O4. If re O, ye O, ze 0, and 
<o2, then O5. ae O and a<ob only as required by 01-04. 

We recall the following lemmas (I)-(VIII) from 12. Here C(b) isa 
class of numbers depending on b (defined in 12), such that (as shown in 18) 


(32) ae C(b) = (Ex) V (a,b, x) 


for a certain primitive recursive V. We write aXob for a<obVa=b, 
a>ob for b <oa, ete. 


(1) IfaeO, thena0. (II) Ifa<ob, thenbAl. (III) 
is empty. (IV) If a<ob, then aeO and beO. (V) If a<o2", then 
a@=oy. (VI) If a<o3- 54, then, for some n,aSo Yn, where Yn = No). 
(VII) If beO, then aeC(b)>~a<ob. (VIII) If beO, then a<ob 
—aeC(b). 


(31) read “b=3-5"” for “a=3-5"”. On p. 54 in (35) read “a=3-52” for 
“e=3-5@:” (an adaptation of 14 is used in 27). In order that 16 Theorem 2 give 
the full force of (41)-(44), it should be added in each part that the function also map 
one-to-one the complement of the one set on a subset of the complement of the other; 
this being added, the first part becomes false. Theorem 3 (which should have heen 
indicated as classical) is now included in the much stronger Corollary to Theorem I 
(also classical ) 

®° The reader may wish to follow the modifications outlined in 30 for the relative 
version of the theory as he reads 20-29 giving the absolute version. 

710; or [9] Sections 4, 7, 10; or [11] pp. 225, 279, 281, 288, 327, 330, 340. 

® The theory of constructive ordinals in various versions appears in Church-Kleene 
[4], Church [3], Kleene [8] (cf. Section 3 for ng, Section 5 for 8,) and Turing [20] 
Section 7. 
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We add several further lemmas. Some of these are proved, as were 
preceding ones, by induction in a form which corresponds to the inductive 
definition 01-05. This is illustrated by the proofs of (VII) in 12 and 
(XX) in 22, which are given in full. 


(IX) If aeO, then a=o1. (By induction.) (X) If aeO, then, 
for each number-theoretic function a(n) such that a(0)=—a and 
(n)[a(n) 41— a(n) >oa(n+1)], there is a number k such that 
a(k)=1. (By induction. For example in the part where it is assumed 
that a = with ye O, by (1), a¥1, so a—2(0) >0a(1). By (V), either 
y= a(1) (Case A) or y>oa(1) (Case B). We apply the hypothesis of the 
induction in Case A to B(n) =a(n+1); in Case B (then by (II), y1) 
to B(n) =y if n=0, — a(n) if n50.) 


(XI) For each a, a<oa. (By reductio ad absurdum, from (11), 
(IV), and (X) applied to a(n) =a.) (XII) If a<ob, thenb<oa. (By 
(IV). O4 and (XI).) (XIII) If ce 0, and bSoc, then a<ob or 
a=b or a>ob. (By induction on c.) 


Together 04, (XI) and (XII) give that O is partially ordered by <o. 
Then by (XIII) the members of O which are <o a given member BD are 
linearly ordered, and indeed by (X) with (IT) well-ordered, by <o. Thus 
far we have not had to mention the classical second number class. 

Letting |1|=0, | for yeO, and | 3-54| | y, | for 
3-WeO and y,=—®(y,no), each member 6 of O represents an ordinal | 6 | 
of the first or second number class, in such a manner that whenever a <b 
then «| <|b6| and that for each 2 <|b6| there is a member a of O such 
that -a@' =a and a<ob.® By this with (XIII), the ordering of the set 
i(a<ob) by <o is of the type |b|. The least ordinal >|6| for every 
be O is called o,. 


21. A digression concerning recursive well-orderings. A partial recur- 
sive function enm(b,m) which for each fixed 6 >o91 enumerates the numbers 
<o 6 is constructible by the method used in the earliest version of the theory.’ 
However it is worth noting that the result is immediate from 12 and 13. 
It is easily arranged that enm be primitive recursive and that the enumera- 
tion be without reptitions when 6 represents an infinite ordinal. In detail: 


* Cf. 11, [3], [8], and for an earlier version [4]. Also for (XIII), ef. [3] pp. 
229-230. 


7° [4] p. 20 enm, and [8] p. 154 end Section 3. Also cf. [20] pp. 188-189 Inf. 
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Combining (VII) and (VIII) with (32), 
(21. 1) beO> (a <o =(Hr)V (a, b,x)). 


Let Fin(b) = (En)n2a[b=no], so Fin(b) ={beO by O1 and O02 only} 
=({b represents a finite ordinal}. Let nat(b) =prraa[b—no], so that 
nat(n9) =n. (When Fin(6), nat(d) is as a natural number.) We use 
(x); in the sense of [11].1* Now in the case Fin(b) (whether or not be 0), 
let enm(b,m) be (m)o, if V((m)o,b,(m),) and (m), is not among 


enm(b,0),- - -,enm(b,m—1), and otherwise let enm(b,m) be the first 
among -,mo which is not among enm(b,0),- - -,enm(b,m—1). In 


the case Fin(b), let enm(b,m) = (nat(b) ~—(m-+1))o.7? This gives a 
course-of-values recursion on m for enm(b, m), so enm is primitive recursive." 


(XIV) If beO and Fin(b), then (n)[no<o6]. (By induction.) 
(XV) There is a primitive recursive function enm(b,m) such that: (i) Jf 
b= no, then enm(b,m) = (n—(m-+1))o. (ii) If be O and Fin(b), then 
enm(b,0), enm(b,1), enm(b,2),- is an enumeration without repetitions 
of a(a<ob). 

In Cantor’s theory, ordinals arise on the one hand as the order types 
of well-ordered sets, and on the other as the objects generated by certain 
principles. In the present theory of constructive ordinals, it is the latter 
version of the theory which is made constructive. We inquire now into the 
relation between this and the result of making the first version constructive, 


i.e., of requiring the well-ordering to be recursive. 


(A) There ts a partial recursive ternary relation m <yn such that, for 
each fixed b: If be O and Fin(b), then the binary relation m<yn is 4 


general recursive well-ordering of the natural numbers of order type | 6 |." 


Proof. Suppose be O and Fin(b). The numbers <9 b are well-ordered by 
<o with the order type | b| (end 20). But enm(d, 0), enm(d, 1), enm(d, 2),- - - 
is an enumeration of them without repetitions. So it will suffice to define 


11 [11] p. 230, viz., (#); = {the number of times contains the i+ 1-st prime 
number as factor} if ~ 0, = 0if 0. The author originally introduced (@),,:+ + ,(2)m 
in [9] (1943) p. 50 for an unspecified m-tuple of functions with certain properties, and 
then in the first paper of this title (1944) 5 defined (#);, so that the (#);,, there is the 
(a); now; but since 1946 he has undertaken to standardize the notation (a); in its 
present meaning (e.g., it was so used at his suggestion in Nelson’s [16] (1947) ). 

12n+m=—n—m if n=>m, = 0 otherwise (cf. [11] p. 223 #6). 

18 Péter [17] Section 1 or [18] Section 3, Kleene [11] Section 46. 

14 A related result is given by Turing [20] p. 183 (vi). 
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m <yn partial recursively so that, for the b in question, m <<,n=enm(b, m) 
<eenm(b,n). To determine whether enm(b,m) <genm(b,n), we first 
check that m4n. If mn, then either enm(b,m) occurs in the enumera- 
happens if enm(b, m) <»enm(b, n) and only then (since enm(b, m) = enm(8, 1) 
= 1isexcluded), or else enm(b, n) occurs in the enumeration enm(enm(b, m), 0), 
enm(enm(b, m), 1),enm(enm(b, m),2),- - -, which happens if and only if 
enm(b,m) >oenm(b,n). So we take 


m<pn=mAn&Kenm(enm(b,n), 7) —enm(b, m) 


where 
T = pt[enm(enm(), n), t) = enm(b, m) V enm(enm(b, m), ¢) = enm(b, n) |. 


(B) To any general recursive well-ordering of the natural numbers, 
there is a primitive recursive well-ordering of the natural numbers with the 


same order type.*® 


Proof. The construction which follows will lead from a given general 
recursive well-ordering < of order type x to a primitive recursive one <’ of 
typew+a. But and for a primitive recursive 
well-ordering of type « is easily defined. 

Let e be a Godel number of (or define recursively) the general recursive 
ordering predicate m< n; so and 


T.(e,m,n, x) > [m< n=U (zr) = 


For any natural number n, let the image of n be 2”- 3maXmsnmaTilemnz), Tf ] is 
the image of some number (in symbols, Im(/)), that number is (1)o. In the 
new well-ordering <’, the non-images in order of magnitude among them- 
selves shall precede the images ordered among themselves as the numbers of 
which they are images were ordered in the original ordering <. Since the 
class of the non-images is infinite, the new ordering <’ is of order type 
o+a. The following formulas show that the new ordering <’ is primitive 
recursive. 


Tm (1) = 20-301 & (Hm) m, (1) 0, (1)1) 
& (mM) m, (1) 0, 2), 


18 By [11] pp. 285-287 Example 1, ‘ primitive recursive’ can be strengthened here 
to ‘elementary’ in the sense of P. Csillag and L. Kalmar (cf. [6], [18] Section 8, 
or [11] p. 285). 

16 [9] pp. 52-53 Theorem IV, [11] p. 288 Theorem IX, or (briefly) 10. “(E!a2)” 
means “ there exists a unique ¢ such that ”. 


= 
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{Im(k) & Im(1) && <1} V {Im(k) & Im(1)} 


V {Im(k) & Im(1) & {[(k)o < (1) 0 &U 05 (1) 0,2)) =0] 


> (1) o& (1) 0, (k)o, %)) A0]}}. 


(C) There is a primitive recursive ternary relation m <p’ n such that, 
for each fixed b: If be O and Fin(b), then the binary relation m <)'n ts a 
primitive recursive well-ordering of the natural numbers of order type |b|. 


Proof. We apply the method of proof of (B) to the m <,n of (A) as 
the m< n. But now we can distinguish primitive recursively which elements 
in the given ordering are infinite; namely {n is infinite} = Fin(enm(b,n)). 
So by letting all numbers not images of infinite elements precede the images 
only of infinite elements, we avoid the increase for a <w? from « to w+ @. 
As a Godel number e of the ordering m <,n, we have S,1(d,b),'7 where d is 


a Godel number of m <yn as predicate of b, m,n. 


By (A), the order types of general (and hence by (B), or directly by 
(C), of primitive) recursive well-orderings of the natural numbers include 
That they include only such ordinals was 
shown by Markwald [15] (and independently by Clifford Spector, from 
whom the author first learned the result in February 1954 shortly before [15] 


all transfinite ordinals < «. 


was received in Madison). 


22. Sums of ordinal notations. 


was treated briefly by Church and Kleene.** We redo the treatment, in order 
to secure an additional property of the sum function which we shall need 


now ((XX) below). 


We seek a recursive function a+ 90 such that 


aif b=1&a-~0 


atob= 


3° if b= 3- 


otherwise 


where (in Case 3) dq, defines recursively a+9(y,no) as a function of ng. 


We obtain such a function a+ 90 as follows. 


(Case 1), 


if b= 24” where y 540 


(Case 


(Case 3), 


(Case 4), 


Let 


6(2,4,y,n) = n)).7 


17 End 10, or [8] p. 153, or [11] p. 342 Theorem XXITI.** 


18 [4] p. 18, [8] p. 154. Also Turing [20] p. 188. 
1° Here, since a+ 9(y,% 9) may be undefined, we 


No)}. 


mean that 


The addition of ordinal notations 


(2) No) 


| 

(22.1) — 

t 
F 

‘ 
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Let P define ( ly Y, n) recursively.?® Let 11:17 


y(2, a,b) = 
V {b = 20 & KO & 
V = 3 - & = 8 - 


V {(b41Va=0) & = 0) &H A+ 5O2& t= 7}]. 


By the recursion theorem,”° there is a number e which defines y(e,a, 6) 
recursively. Let a+ob=vy(e,a,b). This gives a+b as a partial recur- 


sive function, but in fact it is primitive recursive.” 
(XVI) If aeO and beO, then (a) a+obe0 and (b) 
<oa+o5]. 


The proof is by induction on b with a fixed. Parts 1 and 2 (for be O by O1 
and O2, respectively) are straightforward. For 3, assume that 6 =3-5¥ where 
y is as in O3, and as hypothesis of the induction that (XVI) applies to each 
Yn as the b. (a) By Case 3 of (22.1), a+ o9b—3-54 where for each n, 
®(d,no) is defined (call it d,) and d,—a-+oyYn. Applying the hypothesis 
of the induction, we have by (a) (for y, as the b) d,eO and by (b) (for 
Yns aS the c, b) dn<odnu. Hence by 03, 3-5%eO and 54, 
ie, @+obe0O and a+oyn<oa+ob. (b) Assume c<ob. By (VI), for 
some n, either cy, (Case A) or ¢c<oYn (Case B). Case A. a+toc 
=at+toYn<od+ob. Case B. By the hypothesis of the induction (applied 
to ¢, Yn as the c,b),atoc<oatoyn But atoyn<oatodb. By 04, 
atec<oatob. 


(XVII) Jf aeO and b>o1, then a+ob>oa. (Use (IV), (XVI) 
(b), (1), (22.1).) (XVIII) If aeO, then a+olo>ol. (By (22.1), 
(1), O02 and (IX).) (XIX) Jf aeO and beO, then a+obZoa. (Use 
(IX), (XVII), (22.1) and (T).) 


(XX) Jf at+tobeO, then ac O and beO. In fact, for every c which 
(a) Ifatob=c, then acO and beO. (b) If 
then bj (c) If bA1&aZSof <oa+ob=c, then 


(Eg) [9 <ob&f=a-+og]. 


Proof is by induction on c, as follows. 
20 [8] p. 153 the last result of Section 2, or [11] p. 352 Theorem XXVII. 
*1 For upon using the resulting value S,°(p,e,a,y) of dg, in (22.1), we have a 


course-of-values recursion for a +, 
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1. Assume thatc—1. (a) Assume a+o9b—c. Now Cases 3 and 4 of 
(22.1) are excluded. Also Case 2 is excluded, since this could only apply 
if a+oy=0, and in none of the cases of (22.1) (for y as the 6) can 
atoy=0. So Case 1 applies, with a—=b=—1. But 1eO. So a,beO. 
(b) Assume By (a) applied to a+ 5, and a+ bz, 
a=b,=—b.=1. So b,:—b.. (c) By (II), f<oa+ob=ec cannot hold 


when c—1. 


2. Assume that c= 2¢ where de O, and as hypothesis of the induction 
that (XX) applies to d as the c. By 02,d<oc. (a) Assume a+ob=—c. 
Now only Case 1 or Case 2 of (22.1) can apply. In Case 1,a—c and b=1. 
But c,leO. So a,beO. In Case 2, d=a-+oy and b=2”. By (a) of 
the hypothesis of the induction, then a,yeO, so using 02, a,beO (and 
y<ob). (b) Assume a+o0b:=a-+o0b:=c. If Case 1 applies to both 
a+ob, and a+ob., we have If Case 2 applied to a+ ob, but 
Case 1 to a+o)., we would have where y~0 so (and by 
(a) a,b,e0, and by (IX) b;>01), but Thus we would have 
a=c=a+tob,>oa (by (XVII)), contradicting (XI). So this pair of 
cases cannot apply; and similarly vice versa. If Case 2 applies to both a +o d, 
and a+ob., we have b, = 2%, b, = 2” and c= 24 = whence 
d=atoy,—a+oy:. By the hypothesis of the induction, y;—y-. So 
b,=b2. (c) Assume <oa+ob=—c. Since Case 1 
does not apply to a+o0b; and by Case 2, d=a-+oy and b=2", whence 
using (a) yeO and y<ob. Now by (V), either f—d (Case A) or f <od 
(Case B). In Case A, y is a g. In Case B, y¥1, as otherwise we would 
have f <<o9a+o1—a (since by (a) ae 0, so by (1) a0), contradicting 
the hypothesis a Sof (by (XI), (XII)). By the hypothesis of the induction 
there is an h such that h <<oy&f=a-+oh. By O4 (since y <ob), kh <ob. 
Thus h is a g. 


3. Assume that c—3- 5%, where, for each n, @(d,n,) is defined (call 
it dn), dnxe O and dy <odn.1, and as hypothesis of the induction that (XX) 
applies to each d, as the c. By 03,d,<oc. (a) Assume a+ob—c. Only 
Case 1 or 3 of (22.1) can apply. In Case 1, ac and b—1, soa,beO. 
In Case 3, b=3- 54 and, for each n, ®(d,no) =a-+o(y,no). But since 
@(d,no) is defined (—d,), a+o0®(y, no) and hence no) is defined (call 
it Yn),*? so that dn —=a-+oYn. By (a) of the hypothesis of the induction, 
a,yneO. By (XIX), a@Sody. Were we would have d,,, =a So dp, 


72 Under the weak sense ([8] p. 152 Il. 20-22, or [11] p. 327), a+o(y,m9) can 
be defined only when ®(y, 79) is defined. 


—_ eh 


414 
( 
t 
a 
I 
i 
( 
4 
| us 
of 
a 
g 


PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS. 415 


contradicting dy, Thus 1. Also dy <o Using 
the hypothesis of the induction to apply (c) (with dy, dnii, Ynsx as the f,c, b), 
there is a g such that 9 <0 Ynu &dn=a+tog. But d,=a+oyn. Using the 
hypothesis of the induction to apply (b) (with d, as the c), g=Yyn. Hence 
Yn <oYnsi- Now by O3 we can conclude that 3-5%¢O (and for each n, 
Yn<o3°5Y=b). Thus a,beO. (b) Assume It 
Case 1 applies to both a+ ob, and a+ ob., or Case 3 to either and Case 1 
to the other, we can argue as under Part 2. If Case 3 applies to both, then 
b, =3-5% and b. —3- 5 for some numbers y; (= (01)2) and ys (= 
and by the actual construction of the function +o, a +ob, = 3 - 55°(e,ea,(t1)2) 
and = 3 Hence 81°(p, e,a, (b1)2) = S13 (p, e, a, (b2) 2). 
By the construction of the function now (b2)>, 
Yi =Ye2, 80 = (c) Assume b41KaSof <oa+tob=—c. Case 1 
does not apply; and by Case 3, b=3- 5” ete. (cf. under (a)). By (VI), for 
some n, either fd, (Case A) or f <od, (Case B). The proof that there 
is a g is completed in the same manner as in Part 2. 


23. Finite sums. For any number-theoretic function (x), let 


p(x) i, 
(23. 1) 
Xo = Loy (x) +oy(m). 


Then Soy(x) is a function of m primitive recursive in y. Similarly with 
instead of y(z). 


(XXI) If then Sow(x)eO. (By induction on m, 
using (XVI) (a).) Conversely: (XXII) If So € O, then (©) p(x) O}. 
rm 
(Similarly, using (XX).) (XXIII) If O] and >o1, then 
W(x) <o So w(x). (Use (XVII) and (XXI).) 


24. Securability. We can represent a finite sequence a(0),---,a(2—1) 

of numbers (x=0) by the single number p,**1.** There is 

a particular primitive recursive predicate 7,*(v,z,a@) such that, for each 

general recursive predicate R(a,a,x),? there is a number f such that 


*3 [11] p. 342. Erratum (1952 printing): 1. 7 from below, replace “ds+--” by 
“d«[Zexp- + -]”. 

24 As in [10] p. 680 and [12] Footnote 2; p, = {the i + 1-st prime number} ([11] 
p. 230 #18). 
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(Ex) R(a, a,x) = Thus any predicate of the form 
(Ex)R(a,a,x) with R general recursive can be taken to be of the form 
(Ex)R(a,4(x)) with R primitive recursive. 

Define Seq(w) =w & (1) (w)i so Seq(w) is primitive 
recursive and Seq(w) ={w=-y(x) for some y and z}=({w represents a 
finite sequence y(0),---,y(w—1)}. Here y(t) =(w)i—1 -, 
z—1). 

Consider any predicate R(a,v) and any such number w—/7(z). 

We say w (or the z-tuple y(0),- - -,y(e—1) which w represents) is 
secured (with respect to R and a), if (£t)t=2R (a, 7(t) ). 

We say w (or y(0),---,y(w—1)) is immediately secured (with 


respect to R and a), if y(0),- - -,y(w—1) but no proper initial segment 
of it is secured, i.e., if R(a,7(x)) & (t) R(a, ). 
We say w (or y(0),- - -,y(@—1)) is past secured (with respect to R 


and a), if w is secured but not immediately so, i.e., if (Lt) :<2R(a,7(t) ). 

We say w (or y(0),- + -,y(—1)) is securable (with respect to & 
and a), if w is secured or («)(Ht)R(a,wa(t)), ie, if no matter how 
y(0),- (if it is not already secured) is extended as y(0),-- -, 
a sequence 
will eventually result which is secured.*° 

Applying this notion to the case w=1—y(0), («)(Lx)R(a,a(2)) 
= {1 is securable with respect to #& and a}. 

Any number w not of the form 7(x), i.e., such that Seq(w), we take 
to be not secured, immediately secured, past secured or securable. 


Now let “securable,;” express the notion of securability as just defined 


explicitly, and “securable;” the notion we define next by an inductive 
definition S1-S3. For brevity we are omitting the words “with respect to 


R and a”. 


S1. If w is secured, then w is securable. S82. If Seq(w), and w is 
not secured,?* but, for every natural number s, w*2**! is securable, then w is 
securable. S3. w is securable only as required by S1-S2. 


(D) If w ts securable;, then w ts securableg. 


°° From [11] Theorem IV* pp. 281, 292 (with pp. 231, 291, where a(a#) and 
T,'(v,2,a@,@) are defined) by putting 
T,)(v,2,@) = exp (v); + 1,2, a, lh(v)), 
noting that lh(a(#)) =a p. 230 #20 (as in [10], or [12] Footnote 2) .11:12-74 
26 wxa(t) represents the latter sequence; cf. [11] p. 230 #21. 
*7 We could omit “w is not secured” in S2, but we prefer to keep it, so that the 
inductive definition analyzes securability into two exclusive cases Sl and S82. 
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This is easily proved by induction in the form corresponding to the 
inductive definition S1-83. Conversely: 


(E) If w is securableg, then w is securable. 


We establish this classically. Brouwer argues (in substance) that it is 
the case intuitionistically (in [2] Footnote 7). 

Suppose that Seq(w) but w is not securable;. Then by 91, w is not 
secured (so R(a,w*a(0))), and by S82 for some number 5, w*2%o*? is not 
securabley (put so 2%*1=4(1)). Then by S1, w*a&(1) is not 
secured (so R(a,w*&(1))), and by S2 for some number s,, w*a(1)*2**? is 
not securable; (put «(1) —<s,, so Then by 81, w*a(2) 
is not secured (so R(a,w*%(2))), etc. Thus we obtain a function « such 
that (t)R(a,w*a(t)). Hence (a) (Et)R(a,w*a(t)), besides which w is not 
secured, i.e., w is not securabley. 

The remarks in this section apply similarly with a@,,---,a, for any 
fixed n = 0 in place of a (then 7,1 becomes T’,'). 


25. Classification of ace O and a <,b from below. 


THEOREM I. Given any general recursive predicate R(a, a,x), a primitive 
recursive function €(a) can be found such that (a)(Ex)R(a, «, x)= (a) € 0.78 


Proof. By 24, after choosing a primitive recursive R(a,v) so that 
(Lr) R(a, 7) = (Hr) R(a,a(x)), it will suffice to choose é so that é(a)e¢O 
if and only if 1 is securable with respect to this R and a. We begin by 
defining (a, w) so that €(a,w) eO if and only if w is securable with respect 
to R and a. Then we can take (a) —€(a,1). 

We choose the function €(a,w) to satisfy 


(0 if Seq(w) (Case 0), 
25.1)  &(a,w) if w is secured (Case 1), 
3° 5fe~ if Seq(w) but w is not secured (Case 2), 


where (in Case 2) fa defines recursively So (é(a, w#281)+o19) as a 


function of no. 


*8Qur proof of the implication (a) eO— (a) (Ha)R(a,a,2) makes use of the 
classical second number class.—Putting &(a@) = &(@)+odo, & has the properties of é in 
the theorem, and a ~ b > &(a) = &(b); so the decision problem for @(a) (Ha) R(a, a, @) 
is 1-] reducible (Post) to that for O (ef. [19], [11] p- 343 Example 2). 


q 
a 


S. C. KLEENE. 


We obtain such a function é as follows. Let 


6(z,a,w,b) = Yo w#2***) +o1o) 


s<nat(b) 


(cf. 21). Let p define 6 recursively. Let 


it Seq(w), 


25.2) y(z,a,w) =- 1 if Seq(w) & (Et) ram) (a, 


if Seq (w) & (FE) (a, II 


Then y is primitive recursive. Choose e by the recursion theorem so that ¢ 
defines y(e,a,w) recursively.2° Let é(a,w) =y(e,a,w). 

Now we prove by an induction of form corresponding to the inductive 
definition S1-S3 of securability that, if w is securable, then €(a,w) ¢0O. 


1. Assume w is secured. Then by Case 1 of (25.1), &(a,w) =1. 
So by O01, é(a,w) € O. 


2. Suppose Seq(w), w is not secured, but, for every s, w*2**? is securable, 


and as hypothesis of the induction that, for every s, (a, w#2**) eO. Then by 


(XVI) and (XXI), for each n, So (é(a, +010) Moreover by 
s<n 


(XXIII) and (XVIII), for each n, 
DdXo (€(a, w#2**?) +010) <o (€(a, w#28**) + glo). 
s<n 


Hence by (25.1) Case 2 with 03, é(a,w) €0O. 


Conversely, we prove that, if €(a,w) —c and ceO, then w is securable. 
We give the proof by transfinite induction on |¢|.2° The cases in the proof 
correspond to the clauses 01-03 by which c can &€0O. 


1. Assume c—1. Then é(a,w) 1; it is Case 1 of (25.1) which 
applies to é(a,w) ; and so w is secured, a fortiori securable. 


2. Assume c= 24 where ye O, so by (1), yO. But then é(a,w) —c 


is impossible under (25.1). 


*° Several attempts to get by with induction on ¢ instead of on | ¢| were unsuccess- 
ful. A difficulty is that a + ,b is not necessarily >) b. Indeed there is no function o 
such that, for all a,b2e0O, both aob =>oa and aob=ob. For if e and f define recur- 
sively and (n+ 1) 9, respectively, as functions of ng, and a= 3- 5° and = 3: 5/, 
then |a|=|6|=wo, but ab; so by the linearity of the ordering by <o of the 
members of O which are <9, a given one and the consistency of this ordering with that 
of the ordinals represented (end 20), a and } cannot both occur among the numbers 
<o 2°°”. Another difficulty is that the associative law (a+,b6)+ = a+ o(b+oc) 
wnen a— 1,6 2, ¢ = 3-5’. 
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3. Assume c=3- 5” where, for each n, ®(y, no) is defined (call it yn), 
yneO and Yn<oYnu (and by 03, yn<oc). Now it is Case 2 of (25.1) 
which applies to (a,w). Thus Seq(w), but w is not secured; moreover 
(as y=faw), for each n, Yn = Zo (€(a, +010), so 

DdXo (E(a, +010) and (é(a, +010) 
s<n 
Using (XXII) and (XX), for every s, €(a, w*2*') eO. Furthermore, since 
(taking n above +1) 


¢>o Do (€(a, w#2™**) +ol1o) 


m<stl 
= Do (é(a, w#2”**) +olo) +o (E(a, w#28*") +o1o). 
m<s 
| €(a, | < | E(a, +olo| S| Yo (E(a, w#2™"") +olo| < jc]. 


m<stl 
So by the hypothesis of the transfinite induction, since €(a, w#2#*?) e O, w#28* 
is securable. This is for every s, and Seq(w), so w is securable. 


Corotuary. The predicates ae O and a<ob are not expressible in the 
forms (Ea) (x)R(a,a,x) and (Ha) (x)S(a,b,a,2) with general recursive 
R and 8, respectively; a fortiori, they are not arithmetical. 


Proof. Were aeO= (Ka) (x)R(a,a,2) with R recursive, by applying 
the theorem to 7,*(a,a,x) as its R(a,a,x), we would have 


(a)(Hx)T ,*(a, a, ©) = (H2)(x)R(E(a), a, = (F2)(x)R,(a, 2) 


with recursive #,, which is impossible.*° The statement for a<ob follows, 
since ae O=a <o 2% by O2 and (IV). 


26. A digression concerning recursive linear orderings. Let us examine 
further the construction used in the proof of Theorem I. 

We use > to denote the linear ordering of the set #Seq(w) of the 
numbers w of the form &(x) which is established by ordering the finite 
sequences -,a(«—1) lexicographically with the infinite alphabet 

-,2,1,0. In this ordering, 1 (which represents the empty sequence) is 
the highest element. If Seq(w) or Seq(v), wu >v shall be false. Then the 
predicate wu >v is primitive recursive. 

Now, for any given predicate R(a,v) and numbers a and w, we define 
a set S%4,,. or briefly S*, or S., as follows. If w is secured (with respect to 
R and a), S,, shall be the unit set {w}. If Seq(w) and w is unsecured, S,, 


°° [12] Theorem 3 Proof for » >0. Also cf. [12] Theorem 5. 


; 
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shall consist of all w*é(t) which are unsecured or immediately secured. 
If Seq(w), Sw shall be the empty set. Then we S*,, as predicate of u.w,a 
is primitive recursive uniformly in &. 

If weSy», then Seq(u). So if Seq(w), 9, is linearly ordered by > 
with w is highest element. 


(F) The set Sy is well-ordered by >, if w is securable, in which case 
the ordinal number of Sy is | é(a,w)| + 1.3 


We prove this, as follows, by an induction of form corresponding to the 


inductive definition of securability. 

1. Assume w is secured. Then S, is {w}, which is well-ordered with 
ordinal number 1. But by Case 1 of (25.1), €(a,w)=1—0o. So 
|E(a,w)| +1=0+4+1—1. 

2. Assume Seq(w), w is unsecured, but w*2** is securable (s = 0, 1, 2.- - -), 
and as hypothesis of the induction that S,,,2.»1 is well-ordered by > with 
ordinal number | €(a, w*2#*t)|-+1. Then under the ordering >, S,, is the 
sum of {w},- + +, Sweex, Sweo2, Sweo1, and so is well-ordered with the ordinal 


number > (| €&(a,w*2*4)|+1)+1. But by Case 2 of (25.1). 


$=0,1,2, 


= (| €(a, +1) é(a,w)| +1. 


8=0,1,2,--- 


(G) If Seq(w), and 8, 1s well-ordered by >, then w is securable. 


For we can show by transfinite induction on the ordinal number of 8, 
that, if Seq(w), and 8, is well-ordered, then each member w*a(t) of S,, 
is securable; so in particular, then w (—w*a(0)) is securable. 

Now we return to (F), and give another argument for the part not 
concerning €, in which we show that, if S,, were not well-ordered by >, 
w would not be securable. 

Accordingly suppose S,, is not well-ordered by >, so there there is an 
infinite descending sequence B(0) > B(1) > B(2) >- - - within S,,. Of course 
then Seq(w) and w is unsecured. Write w—y¥(x) (then r=—lh(r)). 

Now we shall infer by induction on ¢ that, for all sufficiently great m 
(say m= mz), lh(B(m)) and (B(m)),—1,: are 


fixed in value (call their values then a#(0),- - -,a(t), respectively). Let 


*: That 8, is well-ordered by + if w is securable was argued in substance by 
Brouwer [1] Section 1 or [2] Section 2. For w securable, S8,, consists of w and, if w 
is unsecured, those numbers w*&(t) coming below w in the lexicographic ordering + 
which under the inductive definition of securability must be recognized as securable 
before w itself can be so recognized. 
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nj =m, if t>0 if ¢=0), so that, using the hypothesis of the 
induction, for all m=, B(m) represents a sequence the first «+ ¢ values 
of which are fixed as y(0),- - -,y(w—1),a(0),---,a(¢—1). Then by 
the lexicographic ordering, for m >, B(m) must represent a sequence 
of length > and with increasing m (m > (B(m) is mono- 
tone non-increasing. But since (8(m)),,4——1 is a natural number, starting 


with its value for m—n,-+1, it can decrease only a finite number of times, 
so beginning with a certain value m; (> of m, remains 
fixed, call its value then a(t). 


We can express « in terms of 8 and w by the formula * 
a(t) (es(m )ma(s)ol (B(m) — 1 (s),] 


So a is arithmetical uniformly in £.** 

Since B(m;) is of the form w*a(t-+ 1)#*v, and B(m:) eS», w*a(t) and 
w are unsecured ; a fortiori, R(a, w*a(t)). 

Thus (Ha) (t)R(a, w*a(t)), whence (a) (Ht) R(a, w*a(t)), besides which 
w is unsecured ; so w is unsecurable. 
So we have another proof of (F). In the course of this we have shown: 


(H) For each w, there is a function a arithmetical uniformly in B such 
that, if B(0),8(1),B(2),: -e Se and B(0) >B(1) > B(2) >: then w 


is unsecured and (t)R(a,w*a(t) ). 


These results, of course, hold with a,,- - -,d, (for any fixed n=O) in 
place of a. Now we give an application with w—1 and n=0. 

Elsewhere we have shown that there is a primitive recursive predicate 
R(v) such that (a) (#«)(x)R(a&(z)), but (b) for no arithmetical a, (x)R(a(x)).**4 
Consider the set 9”, or briefly S,. By (a), (@)(Bx)R(G(2)), i.e., 1 is not 
securable, so by (G), S, is not well-ordered by >, so there is a B such that 
B(0), B(1): B(2).: - -e S, and B(0) + B(1) > B(2) >: --. But were there 
such a 8 which was arithmetical, the corresponding a of (H) would be an 
arithmetical function such that (2)R(a@(x)), contradicting (b). 

The set S, is primitive recursive and infinite, and 0¢9, but 1e€S;. 


8° As in [7] or [11] p. 317 (following Gédel) esA(s) = {the least s such that A(s)} 
if (Hs)A(s), = 0 otherwise. 

83 [11] pp. 239, 291, 292, or [12] Section 2. A function @ is arithmetical (arith- 
metical in 8), if its representing predicate «(a#) = w is arithmetical (arithmetical in 
B). Cf. [11] pp. 285, 318. 

84 [12] Corollary to Theorem 7. We are rewriting the (#)R(a,w#) there as 
(w)R(&(w)) (cf. beginning 24 above and [11] p. 228 #D). 
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Let o(0) =1, if n+1e8,, and o(n+1)=o(n) if 
n+1¢8,. Then o is primitive recursive, and o(0),0(1),0(2),° is an 
enumeration of S,; with only consecutive repetitions. Now define 


i j = [o(t) &o(i) >o(J)] V [o(t) = > 


Then 1 >} is a primitive recursive linear ordering of the natural numbers, 
and members of S, are ordered by >” as they are by >. 

So for any infinite descending sequence B(0) > B(1) > B(2) >: - - within 
S,, B(0) >® B(1) >® B(2) >®- - -. Thus there is an infinite descending 
sequence in the ordering >* of the natural numbers. 

But there is none which is arithmetical. For given any y such that 


y(0) >Fy(1) >F y(2) -, let y(0) =0 and 
+1) = lo(y(t)) Zo(y(¥())) 


and let B(n) =o(y(W(n))). Then £ is general recursive, a fortiori arith- 
metical, in y, and 8(0) >B(1) >B(2) >- - - is an infinite descending sequence 


within S,. Thus we have established: 


(J) There is a_ primitive recursive linear ordering >*® of the 


natural numbers such that, for some number-theoretic function y, 
y(0) >Fy(1) >F y(2) >F. 
bul, for every arithmetical y, (En)[y(n) S® y(n+1) ]. 


In other words, >” is a well-ordering of the natural numbers with 
respect to arithmetical sequences y(0),y(1).y(2),° - >, but not with respect 
to arbitrary sequences.* 

In fact, the primitive recursive R(v) can be chosen so that >* is a 
well-ordering with respect to a much larger class C of sequences y(0),y(1), 
y(2),: : + than the arithmetical sequences, but not with respect to arbitrary 


sequences.*° 

°° G. Kreisel raised (in a different formulation) the question whether this is the 
case in a letter to the author dated December 9, 1952 (in reply to a letter from the 
author dated November 13, 1952, in which the result of [12] which leads here to (J) 
was stated), and also in a communication received by Leon Henkin November 20, 1952. 
Kreisel formulated the question in relation to the formal system Z, of Hilbert-Bernays 
[5] and some w-consistent formal system S of analysis including Z,, while here we have 
considered the matter only informally. The arithmetical functions are however exactly 
those expressible by terms of Z,, under the usual interpretation of the symbolism, and 
our arguments about arbitrary number-theoretic functions can presumably be formalized 
in a formal system S of analysis of a usual sort. 

°° We need only start from Theorems 7 and 9 of [12] (instead of from Corollary 
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27. Reduction of ae @ (amended). Now we begin the proof that 
ae O= (a) (Ey)R(a,a,y) for a suitable primitive recursive R. As before, 
we start from 


(28) acO=aeQ 


(end 12) and (32) (in 18, 20). But now we amend the former treatment 
(cf. 14) so that we shall be able to give (in 28) the necessary argument to 
supplement the application of the reduction technique (cf. 18). 

The inductive definition of Q (in 12) can be restated as follows (cf. 
beginning 14"). 

If a=1, then aceQ. If a=2%& then 
0,3. If 


a= 3-5: & (ao) (Hyo)T ((@)2, 0; Yo) 
& (#1) [T((@) 2, (%o) 0, #1) > U €Q] 
& (Xo) (41) [T ( (a) 2, (20) 05 
& T( (a) 2 + 1)0,%2) > V(U (#1), U (a2), 91) 
then ae Q,4. aeQ only as required by Q,1-Q,3. 
3y writing a corresponding equivalence (as we did for various inductive 
definitions in the first paper; cf. 18 here), and then substituting in it for 
“aeQ” the predicate expression “(«)(Hy)R(a, &(y))” where R(a,v) remains 
to be selected, we obtain 
(a)(Ey)R(a, &(y)) =a=1V {a = 2 & (a)(Hy)R((a)o, &(y))} 
V {a= & ((a)2, (Xoo, Yo) & (41) T ((a)2, 1) 
(a)( Ey) (a), | & (0)o5 #1) 
& (Xo + 1)0, V(U (21), U (xe), y1)] }- 
We shall presently define R(a,v) so that it will be primitive recursive and 
27.1) will be true. When this has been done, it will follow from (27.1) 


3 by an induction of form corresponding to the inductive definition @Q,1-Q,4 
B (cf. 18) that 


(27.2) aeQ— (a) (Ey)R(a, a(y)). 


to Theorem 7) to obtain as C, for any fixed yeO, the sequences recursive in the 
predicate H, of [12] Section 6; we apply Theorem 7 to H,, 2, to allow for the definition 
of « from 8 for (H). In [13] XXVI we obtain (with the aid of Theorem II below) 
- improvement over [12] Theorem 7, using which C can be the sequences recursive 
in H, for any ye 
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Note that, if C and D are independent of a, 

CV D> {(a)(C&A(a)) V (a) (DE&B(a)) 
= (a)[(C&A(a)) V(D&B(a))]}. 

To find a primitive recursive R(a,v) satisfying (27.1), we begin by 
observing that in the right member of (27.1) (call it (a)) the quantifiers 
can be advanced (using (27.3) to bring both («@)’s across the second V as 
one (a))** to give an equivalent of the form 
(b) MP (a, xo, Yo, Ys) 
where M¥(a, &(Y), Yo, ¥1) is exactly (a) with its quantifiers omitted. 
Next (b) is equivalent to** 


(c) (a) (Hy) (a, 
{Ata (t+ 3)} ((y=-3)2), (Y= 3)o, (y+ 3)1) KY 3}. 


But y=Ilh(a@(y)), and for y=83, 
a(0) = (4(y))o 1, a(1) = (@(y)): = 1, = (@(y))2 


and (since y=3&i<(y— 3). 


{Ata(t + 3)} ((y=3)-) exp(@(y) )ixs, 


using which (c) takes the form («)(Hy)N*¥(a,@(y)) (call it (d)). Now 
we propose to define R by 
(27.4) R(a,v) =N*¥(a,v). 
This does define a predicate R(a,v) by course-of-values recursion on v, since 
for lh(v) <3 it makes R(a,v) false outright, while for lh(v) =3 it makes 
R(a,v) depend primitive recursively on R(b,w) only for 

w= prexp <v. 

i<(lh(v)*3)2 

The predicate R(a,v) thus defined is primitive recursive.*® Substituting into 
(27.4) and quantifying, 


(27.5) (a) (Hy) R(a, &(y)) = (a) (Ly) N* (a, a(y)), 


87 See 14 p. 54 for justification of the advances intuitionistically. 

88 By [12] Footnote 7, and a modification of 5 (16) 41 [9] (15) or [11] p. 285 (17). 
Following Church, dif (i) is f(t) as a function of i (ef. [11] p. 34). 

%° One can eliminate both the course-of-values character of the recursion on v and 
the substitution for the parameter a by the methods of Péter [17] Sections 1 and 2 or 
[18] Sections 3 and 5. Actually by beginning 24 it would suffice for our results te 
observe merely that R(a,v) is general recursive, which is obvious. 
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whence by reversing the steps from (a) to (d) we now prove (27.1) and 
thence (27.2) for the R defined by (27.4). 


28. Supplementary argument to the reduction of aceQ@. We prove 


that 
(28.1) (a) (Hy) R(a, a(y)) ~aeQ, 


conversely to (27.2). So assume for a given a that (a) (Hy)R(a,a(y)) for 
the R defined by (27.4). Then by 24, 1 is securable with respect to (R# and) 
a; and by 26 (F), the set S*, or briefly S*, is well-ordered by >, say with 
the ordinal number | S*,| (=| é(a,1)|+1). We use transfinite induction 
on this number.*® By our assumption and (27.1), one of three cases applies. 


1. a=1. Then aeQ, by Q11. 


2. 20 & (a) (Hy) R((a)0, &(y)). By 24, 1 is securable with respect 
to (@)o. For any 2,2, 22,4, let y* be the least y such that R((@)o,4(y)), 
so &(y*) is immediately secured with respect to (a) 9; and consider some 
function with B(3 + y*) = where abbre- 
viates For z <3, R(a,B(z)) immediately by the definition 
of R(a,v), and for 83=2<3-+y* because R(a,B(z)) would imply 
R((a)o,%((2—3)2)) with (z—3).< y* contradicting the choice of y*; 
but we can choose z=3-+ y* so that (z—-3).—y*, and then R(a, B(2)). 
Thus there is a w such that [2o, 21, 72]*%(y*)*wu is immediately secured with 
respect to (namely B(u2R (a, B(z)))/B(3 + y*)). So Meludes 
all the numbers [2,2,,%2]*w for weS»,, hence | 8%tx,0,.09) | = | S» |. 
Moreover, under the ordering >, S"f2,2,; is the sum of {[2,2,]} and the 
sets Lor *,2,1,0, hence | | | 8, |-+1; ete. So 
finally | | = o(o( | S$, | +1) +1) +1> | 8, |. So by the hypothesis 
of the induction, By aeQ. 
3. &@=3-5M2 (19)(Lyo)T ((a)2, Yo) & (Xo)(41) [ 2, 1) 

— (a)(Hy)R(U(2,), &y)) | & (xo) (x1) (x2) [ T((a)2, £1) 

& T((a)s, (%o + 1)o, £2) > V(U (21), U (#2), y1) J. 
In particular, for each a and x, such that T((a@)s,(%o)o.#:), 1 is securable 


with respect to U(a,). For any such 2,2, fixed for the moment, and any 
,4, let y* be the least y such that R(U(2,), a(y)), so &(y*) is immediately 


*° There is a longer proof, which avoids the use of the classical second number class, 
and uses instead (E) and an induction corresponding to the inductive definition of 
securability. 
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secured with respect to U(2,) ; and consider some function B with B(3 + y*) 
= 21, 22]*a(y*). For z <3, R(a, B(z)) obviously, and for 3=2z<3-+ 
because R(a,8(z)) would imply R(U(2;), with 
contradicting the choice of y*; but we can choose z= 3 + y* so that (23), 
= y", T((a)o, (Xo)o; (z—3)o) and 

T ((@)2, + 1)0,%2) > V(U (a1), U (a2), (2 —3)1), 
and then R(a, B(z)). Thus there is a u such that [2o, 2, 2]*&(y*)*u is imme- 
diately secured with respect to a. Hence | 8% | | SY), | +3 > | |, 
So applying the hypothesis of the induction, 

(2) (21) [T( (a) >, (2) O- U 

By ae Q.* 


29. Classification of ac O and a <, b from above. 


TuHeorEM II. For certain primitive recursive predicates R and S, 
ae O=(a)(Ly)R(a,a,y) and a<ob= (a) (£Ly)8(a, b, a, y). 


Proof. Letting R(a, a, y)=R(a, a(y)) for the R(a,v) defined by (27.4), 
and combining (28) (12 or 27) with (27.2) and (28.1), 


(29.1) ae O= (a) (Ly)R(a,4,y). 

By (IV) (12 or 20) with (21.1), 

(29. 2) a<ob=beO0& (Hr) V(a,b~,z), 

which with (29.1) by advancing and contracting quantifiers gives 


(29.3) <o b= (a) (Ey) [R(b, (y)o) & V (a,b, 


30. The predicates ac O" and a <,’ b. The definitions of the predi- 
cates ae O and a<ob and the representation of ordinals can be relativized 
by using recursiveness in a given predicate P(a) in place of recursiveness.*” 
To do this, we simply read throughout the first three and last paragraphs 
“| |”, “w,”, respectively.** The resulting notions O?, <9?, | |?, (e.g., 

“ By this proof and 27, aeO—> | 8*,| >| a|. 

indeed in any given list of functions and predicates of m,.- 
variables, respectively. 

48 For T,?, ,?, or what is the same 7,7, ®,7 where 7 is the representing function 
of P, ef. [11] pp. 227, 292, 341. 
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for P(a) =aeQO) are discussed in [12] 6.4 ff. and [13] Section 7 (also 
cf. [14] end 3.5). 

The above arguments carry over to these relativized notions. For the 
most part, the changes required in the text are only the supplying of super- 
scripts “?” on the symbols for various functions, predicates and sets, and 
the reading of “recursive in P” (which can often be “recursive uniformly 


in P”) in place of “recursive” 

Thus in 20 and 21, then 7”, V?, enm? and.m <,’’n are primitive, and 
®? and m<,/n partial, recursive uniformly in P (but no, Fin(6) and 
nat(b) do not change). In 22 and 23 we have a function + 91 and opera- 
tion S'o',** which are still primitive recursive (in Case 3 for (22.1), day 
defines recursively a@+0'®?(y,no) as a function of no uniformly from P; 
6? (2,4, = (Z,a, BP (y,n)); p is a uniform Gdédel number from P of 
6? (2, a,y,) ; in (22.2) 8,3 becomes In 24, using (Hx) RP (a, a, x) 
= (7(x), a(x), f,a) for the representing function of P, any 
predicate of the form (Hxr)R?(a,a,x) with R? general recursive in P is 
expressible as (Hx)R?(a,a(x)) with an RP primitive recursive in P (uni- 
formly so, if R?(a,a,x) was); and the definitions of ‘secured’ etc. are con- 
sidered with respect to RP and a (but Seq(w) does not change). In 25 
Theorem I becomes: Given any predicate R?(a,a,x) general recursive in P 
(uniformly in P), a function €?(a) primitive recursive in P (uniformly in P) 
can be found such that (a)(Ex)RP(a,a,7)=&(a)eO?. In 26 > is 
unchanged ; and replacing R by FR? in our arguments, (J) gives a linear 
ordering > “”, primitive recursive in P, which is a well-ordering with respect 
to sequences y(0),y(1),y(2),° > - arithmetical in P, but not with respect 
to arbitrary sequences. In 27-29, starting from ae OP =aeQ?, we obtain as 
the relative version of Theorem IT: For certain predicates R? and S? primitive 


recursive untformly in P, 


ac OP = (a) (Fy) RP? and a <o? b= (a) (By)S? (a,b, a, y). 
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**The superscript “+” is for the number 1 of the arguments of P; were we rela- 
tivizing to Y we would have + 
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LIE GROUPS AND LIE HYPERALGEBRAS OVER A FIELD 
OF CHARACTERISTIC p > 0 (IV).* 


By JEAN DIEUDONNE. 


1. Introduction. In this paper, we develop to their full generality the 
methods introduced in [4] and [5], which enable us to round off, in a way, 
the theory of abelian Lie groups over a field K. More precisely, we show that 
this theory is entirely equivalent to the study of finitely generated modules of 
a certain type over a certain ring 2 and of their homomorphisms (Theorems 3 
and 5). If, for simplicity’s sake, we suppose K perfect, then % is defined 
in the following way: let # be the ring of integers in the complete unramified 
p-adic field & having K as its field of residues; then & is the ring of non 
commutative power series ro+Tr,+-:--+T7"%r,+:--, where r,eR and 
rT =Tr*¢ for every re, o being the automorphism of which corresponds 
to the automorphism ¢—£? of K in the usual way. The ring % is not a 
principal (noncommutative) ideal ring, and this accounts for the fact that 
the theory cannot yet be considered as complete, there being, as far as I know, 
no satisfactory theory of finitely generated modules over such a ring; but 
at least the kind of difficulties which lie in the way is now clearly delimited. 

The main technical tools, as in [5], are the hyperexponential group and 
direct products of finite numbers of such groups; their role as “universal 
groups” stems from the fundamental fact that their hyperalgebra is free (in 
other words, it is an algebra of polynomials). For technical reasons, it is 
sometimes easier to work with the (additive) Witt group, which is isomorphic 
to the hyperexponential group, but is more convenient for some computations. 
It may be hoped that these tools will also one day be useful in the, apparently 
much more difficult, study of noncommutative formal Lie groups over a field 
of characteristic p. 


2. The ring of endomorphisms of the Witt group. Owing to the close 
relationship between the (additive) Witt group and p-adic rings, it will be 
technically more convenient to determine first the endomorphisms of the Witt 
group; the determination of the endomorphisms of any group of hyper- 
exponential type will then be quite trivial, since such a group is isomorphic 
to the Witt group [5, no. 7]. 


* Received January 20, 1955. 
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We do not repeat here the definition of the additive Witt group Ji 
(see for instance [5, no. 7], or [6, no. 2]); it is a recursive group over the 
weighted variables x; (120), 2; having the weight p‘. We shall write 
= ($0, * * *) its group law, ¢;(x,y) being an isobaric polynomial 
of total weight p* in -,%i, Yo," More precisely, one has 

pi (X,Y) =i t+ Yi + Vi (Los Yor 
and the coefficients of y; are in the prime field F,. 

We can of course consider W as a group over any field K of characteristic 
p; but the notion of endomorphism of W depends on K, and we will say that 
= (Uo, * *) is a K-endomorphism of W if the coefficients of the 
formal power series wu; are in K. We will write €(K), or simply €, for the 
set of K-endomorphisms of W which are recursive, that is, such that u;(x) 
only contains the indeterminates 7, on the other hand, we depart 
here from the conventions made in [5, no. 2] in that we do not assume that 
the minimum weight of u; tends to +c with 1; this can be done here, due to 
the fact that ¢; and u; contain a finite number of indeterminates. The set 
€(K) can then be made into a ring in the usual fashion, wv meaning the 
composite endomorphism u(v(x)), and w+ v the endomorphism w such 
that w(x) is obtained by taking the “product” in W of u(x) and v(x); 
in other words, we have 


3. In order to study the structure of the ring €(K), we first consider 
certain special types of endomorphisms of W: 

1. The “Frobenius homomorphism” p, such that 
(1) pi(x) =a? 

The “shift” homomorphism f, such that 
to (x) =0, t;(x) = 2, 

It is well-known [7] that the product 
(3) 
is the “ p-th-power endomorphism” of the group W, and we have 
(4) =0, (%) (t==1,2,---). 


3. Let (K) be the ring of Witt-vectors A = (do,a,,: over 
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K; it is a commutative ring of characteristic 0 without zero divisors, with 
unit J=(1,0,---,0,---) [7]. To each such vector A corresponds an 


endomorphism 
(5) 


of the additive group W, the “ product” being taken in the sense of the Witt 
multiplicative group; y4,—=(A-x); is an isobaric polynomial in %,- -, 2, 
of weight p‘, with coefficients in K. If in y we substitute 1 for zo, 0 for all z, 
of index 7=1, we obtain the 7-th component a; of A; therefore there is a 
one-to-one correspondence between the Witt-vectors A and the endomorphisms 
(5) they define in W, and this correspondence is in fact a ring-isomorphism 
between ¥8(K) and the corresponding subring of €(K). We shall from now 
on identify these two rings. The endomorphism # is then identified with 
the element p-J = (0,1,0,---,0,---) of W(K); W(K) is a complete 
valuation ring, the valuation w(A) being the smallest index 7 such that a; 0; 
the field of residues is K, and when K is perfect, Y%(K) is isomorphic to the 
unique unramified complete valuation ring having K as its field of residues 
[v]. We define a ring-endomorphism o of W(K) by putting 


With this definition, it is readily verified that we have the commutation laws 
(7) pA = A’p, At=tA’. 
We also notice that the series 
Apt 


represent meaningful endomorphisms, when the A; and B, are arbitrary 
elements of the ring Y(K); indeed, the j-th component of (A;p*)-x is an 
isobaric polynomial in %,- - -,2;, of weight p/**; it follows that, if s, is the 
sum of the first & terms of (8), the terms of weight < p* in any component 
of s,(*) are the same as in the component of s;,,(%) of the same index. On 
the other hand, any component of (#*B;,)-x of index =k is 0; therefore, 
if uw, is the sum of the first k terms of (9), the components of u;,(x) of 
index =k are the same as those of wx,,(%), and this proves our assertion. 
It follows from (7) that the series (8) (resp. (9)) is invertible in €(K) 
if and only if A, (resp. By) is invertible in ¥%(K), which means that 
w(Ao) 0, or equivalently that if Ao (doo,do1,° We have 
doo 540 in K. The inverse is computed in the usual way, as another power 
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series of type (8), whose coefficients are determined recursively by those of (8). 
The results are similar for (9). 


4. We can now characterize the elements of €(K): 


THrorEM 1. If K is perfect, every endomorphism u::E€(K) can be 


written in one and only one way in the form 
(10) > + DUB, 
k=0 k=1 


where the A; and B,, are elements of the ring B(K). 


Writing that wu is an endomorphism, we obtain first the relation 
Uo (Lo + Yo) = Uo(Lo) + Uo(Yo) 
from which the form of wu, follows trivially: 


Let A;= (a,0,0,---) for each «, and consider the endomorphism 
it is clear that wo(%) = and 
therefore the endomorphism w= u— is such that wo(xz) = 0. 

As a basis for an inductive argument, let us now suppose that we have 
u;(x) =0 for 7 <1, and let us determine u;(x). We must have 


(11) U; (Zo, i + ui (Yo, Yi) 
Ui (Lo + Yo, $1(x,y), $i (x, y) ). 

On the other hand, we have #uw—u, and this, using the inductive 
hypothesis and relations (4), gives 
(12) uj (0, +, = 0, 
in other words 
(13) u;(0, 2, = 0. 
Therefore every monomial in wu; contains 2); let us show that uw, does not 


contain Indeed, suppose u(x) =D and 
r=0 


suppose f is the smallest value of r > 0 for which f, 40; identifying in both 
sides of (11) the terms in 2," gives 


fr( Zo, ° =fn(o + Yo,° +F 


where F is a polynomial in 2, -, each term of which con- 
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tains either y; or an 2 of index k=i—1 (due to the fact that every 
monomial in y; contains at least such an z;,). If we replace %,- - -,%.1 by 0 
in this identity, we get therefore 


where ¢, is the constant term in f,, and this shows that f, is a constant. 
But as we have seen before that each monomial in f;, must contain 2, we have 
f, = 9, hence u; cannot contain 2;. Suppose then that k is the highest index 7 


such that u; contains 2;, and let again u;(%) = >) SUppose 
r=0 


k > 0, and that h is the smallest value of r > 0 for which g, 0. Identifying 
on both sides of (11) the terms in 2;”, we see by the same argument as above 
that g, Equation (11) reduces therefore to + Yo) = Ui(%o) + Us(Yo), 
hence 


ui (x) — CoLy + + + + 
Let (¢,0,0,- - -) in YW(K), and consider the endomorphism 


(14) 


It is clear that the j-th components of u(x) and v;,(%) are the same for 
jst (and 0 for 7<1). The inductive argument can then be carried on in 
an obvious fashion, and shows that 


the infinite sum having a meaning in €(K). Therefore, we see that we can 
write u== > t*Cjp/, where CyeX(K). If we now assume that K is perfect, 
ij 


we have 
tC yp) = if 


jp) = ‘nt if 


from which (using the fact that Y%(K) is complete) it is easy to deduce 
that w has the form (10). 

To prove uniqueness, suppose w is equal to the right-hand side of (10) 
and is 0; we have to show that all the A; and B, are 0. Suppose the contrary, 
and let us prove first that it is impossible that w(A;,) =O or o(B,) =0 for 
some k. Indeed, suppose first »(A;) =0 for some A;,0, and let h be the 
smallest of the indices & having that property ; then if An = (dno, 
and the component of index 0 of u(x) would be ajox%”-+- - -, the terms 
omitted having degree > p", contrary to assumption. If on the other hand 
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w(B,) 0 for some B,~0, let again h=1 be the smallest index having 
that property ; then, in the component of u(x) of index h, there would be only 
one term bjt) of weight 0, with ba,40, which is again contrary to the 
assumption u—0. Suppose now we have proved that the minimum value 
of the w(A;,) and (B,) (for the non zero coefficients) cannot be <j, and 
let us prove it cannot be equal to 7. Let us write A,—A;’*, B,—B,’%, 
where the minimum value of the w(A;’) and w(B;’) is therefore 7—1. 
As = =tp, and the relation wp —0 implies w 0, we have 


k=2 


k=1 
This already implies contradiction if w(Ao’) or one of the w(B,’) is 7—1; 
if not, we can again divide by p on the right and after a finite number of 
steps, we reach the desired contradiction. 
Theorem 1 is thus completely proved. In what follows, we shall use only 
the endomorphisms of type 


(15) 


which (without any assumption on KK’) constitute a subring €*(K) of €(K).. 
The preceding argument shows that an endomorphism of €*(K) has only one 
development (15). 


5. Abelian groups associated with endomorphisms of H*. There exists 
a recursive isomorphism h of the additive Witt group W on the hyper- 
exponential group H, such that h;(x) is an isobaric polynomial x;-+-- - - of 
weight p* for each ho(x) hi(x) ([5, no. 7], and [6]). 
Any K-endomorphism of H (in the same sense as in no. 2) can therefore be 
written huh, where ue€(K). In what follows, we will identify u and 
huh ; this will certainly lead to no confusion regarding the endomorphisms t 
and p, for hph- and hth" are respectively the Frobenius homomorphism and 
the shift homomorphism in the group H. 

Consider now the endomorphisms of the group H", direct product of n 
groups isomorphic to H; we will denote by x) = - -,a,{,- - -) 
(1StSn) the systems of indeterminates corresponding to each of the 
factors. A recursive endomorphism u of H* will therefore consist in n 
systems (uo(,- - -,u,,- - -) of power series in the (1 Si<n), 
such that contains only the such that and1Sjn). The 
standard argument determining the endomorphisms of a module which is a 
direct sum of n isomorphic submodules proves in addition that the endo- 
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morphisms u correspond, in a one-to-one fashion, to square matrices (f)) of 
order n, where the f¢) (1Si<n,1j<S7) are elements of the ring of 
endomorphisms of H (identified to €(K)). More precisely, one can write 
symbolically 
(16) uli) +, => ft (x) 
j=1 

the “sum” in the right hand side being understood as taken for the group 
law of H. 

From now on until no. 10, ¢= (¢p)a-o,1-- Will always designate the 
group law of H, so that, with the notations of [5, no. 5], we have 


Yor’ Yn) t+ Yat > Ey, (x) 
1<k<ph 


where H;, is an isobaric polynomial of weight k. 


6. With the preceding notations, we now suppose given an endomorphism 
u of H" such that the endomorphisms f“# of H belong to the ring €*(K) 
(no. 4); owing to (16), this implies that in the power series uw‘, all 
monomials have a weight <p", and in particular the terms containing the 
indeterminates x, (1=j=7n) are of the first (total) degree. Generalizing 
the method introduced in [5, no. 9], we are going to define, with the help 
of u, the hyperalgebra of an abelian group of dimension n (over K). 

In order to do this, we consider the algebra P of polynomials with 
respect to sequences of indeterminates =0,1,- -), having 
coefficients in K, and in this algebra, we consider the ideal a generated by 
the polynomials 


For each index a= (@,:--,@,) (a integers =), let 
Let Xn; be the class of 8); in the quotient algebra P/a, and 


THEOREM 2. The elements Z, form a basis of the algebra P/a, such 
that, with this basis, P/a is the hyperalgebra of an abelian group G of 
dimension n. 
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It is clear that the algebra P/a is generated by the X,;, and more precisely, 
from the relations 


it follows, by an easy inductive argument on h, that the elements of P/a 
are linear combinations of monomials X, in the X,, where the exponent of 
each X,; is <p; moreover, relations (19) also show that the linear com- 
binations of the X, of height h(a) <7 constitute a subalgebra §, of P/a. 
From the definition (18), it follows that the Z, belong to §, for h(a) <r; 
we are going to prove that the Z, of height h(a) <r constitute a basis of 3,; 
the family (Z,.) (without restriction on 2) will then of course constitute a 
basis of P/a. 
The first step consists in proving that the 


h 
(where > v,ip* is the p-adic development of and h <1) constitute a 
k=0 


basis for $,. This will be proved by induction on 1, as a consequence of the 
fact that all the monomials, in the polynomial u,“, have a weight < p*, and 
of the elementary lemma: 


Lemma 1. Let A be a commutative ring with unit element, and let b 
be the ideal in the ring of polynomials A[Y¥.,---,¥n], generated by the 
polynomials 


Fi" — (l1SisSn) 
=1 


m being an integer >1, the bi; and c; elements of A. Then if Y;, is the 
class of Y;mod b, the monomials - with0 Sw; < m, constitute 
a basis of the quotient ring A[Y,,- - -,¥n]/b6 over the ring A. 


To prove the lemma, let us assign weight 1 to the Y;, weight m to each 
polynomial Q;. Then it is readily seen by induction on & that every monomial 
in Y,,- - -,¥Yn, of total degree =k, can be expressed as a linear combination 
of monomials in Y;,- - -,¥n,Q:,- of total weight =k, the exponent 
of each Y; in these monomials being <m. As the number of these last 
monomials of total weight = is exactly equal to the number of monomials 
in Y,,---,Y, of total degree =k, they constitute a basis for the set of 
polynomials in Y,,---,¥Y, of total degree =k; the lemma follows imme- 
diately from this result. 

The fact that the X, with h(a) <r constitute a basis for 8, is then 
proved by applying the lemma in succession to A= K, - A 
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7. Next we remark (see [5, no. 8]) that if rnp" 
(p-adic development, v, >0), there is in the monomial 
> 
om! 
inverse lexicographic order, the preceding monomial is the “first” i L;. 


moreover, if we order the monomials in in the 


Let us next order the S); in a linear sequence: 


If we put a;—= Sv,p! (p-adic development), the preceding remark shows that 
i 
l 
if we order the monomials in the S,; in the inverse lexicographic order, the 


“first”? monomial in L, is the monomial — Sq (with notation similar to 
a! 


that of the X,). Consider now the expression of Z, as a linear combination 
of the monomials .Y), and let us order these monomials in the same order as 
the corresponding monomials in the S;;. We observe that if in a monomia! 
in Hy, there is an exponent pz; = p, then there must be an 
index 1 > k such that we will have «i; < vj, owing to the fact that the L;, are 
isobaric; considering the largest index k for which there is a j such that 
uj = p, we see therefore that in the expression of the corresponding monomial 
every monomial is 


as a linear combination of the X 


afler the monomial —— V4 (which itself comes from the replacement of the Sy; 
a! 


by the X,; in— S8,). due to relations (19). In other words, in the expression 
a! 


of Z, as a linear combination of the 1), the “ first’ term is = Ag From the 
a! 


linear independence of the X,, it follows at once that the Z, are linearly 
independent over K. 

This result ends our proof of the fact that the Z, with h(a) <r con- 
stitute a basis for $,, for their number is the same as that of the X, with 
h(a) <r. 


8. We can now write 


>, depyZ, (dap, eK) 
and by definition, these relations can be interpreted as polynomial congruences 


(21) Kp E, (mod. a) 


in the ring ?. To prove Theorem 2. we apply the existence theorem [5, 
Prop. 1]; replace in P and in the polynomials (17) the indeterminates Sj; 


3 


4 
43% 
| 
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by new indeterminates 7),;, obtaining thus a ring P, of polynomials and an 
ideal a, in P;. and let Y;; be the class of 7),,mod.a,. We then form the 
elements 


and we have to prove that these elements satisfy the same relations (20) as 
the Z,. But from the expression of the ¢; in terms of the ;, and the definition 


of the hyperexponential series, it follows that we have [5. no. 8| 


where 


Now, if in the polynomial congruences (21) we replace each 8); by ®ni, we 
see that the difference 

07 ) ( 


is a linear combination (with coefficients which are polynomials in the )j, 

and therefore in the X;; and Y;;) of the polynomials 

and therefore our proof will be ended if we establish that the expressions (23) 

all vanish. But as the ¢, have their coefficients in the prime field F,, we have 
By? = (Xoi?, +, Yai?) 

and if we equal the expression (23) to 0. we obtain the relation 


dn (X oi?. Y Y ni?) 


But if we replace the 1},” by their expressions (19), and the 17,” by the 
corresponding expressions, the relations we obtain follow from the assump- 
tion that wu is an endomorphism of the group H". The proof of Theorem 2 


is thus complete. 


9. We will say that the abelian group G. the existence of which follows 
trom Theorem 2. is associated to the endomorphism u of H". Our proof 
gives an “explicit” determination of that group when w is known; more 
precisely, the translation operator in @ is given by 


i=1 


LIE GROUPS OVER A FIELD OF CHARACTERISTIC p > 0 (IV). 439 


where in the development of the right-hand side, the powers of the 1); must 
be reduced to exponents < p according to the relations (19). Moreover, the 
algebra P can be identified with the hyperalgebra of the product U” of n 
groups isomorphic with the “universal group” U [5, no. 8], the Sy; being 
the invariant semi-derivations which generate that hyperalgebra; the proof 
of Theorem 2 shows then [5, Prop. 2] that the natural mapping of P onto 
P/a is the derived homomorphism v’ of a homomorphism v of U™” onto G. 


10. Some examples. It is easy to reformulate in terms of matrices 
(with elements in €*(K)) the definitions of the particular groups considered 
in [4] and [5]. For instance, to the null matrix (of order n) corresponds 
the direct product W," of n additive groups; to the unit matrix, the direct 
product (W,*)" of n multiplicative groups. The group G,,»,, defined in 
[5, no. 9] corresponds to the matrix 


0 


0 
0 
where in the last row, the element #” is in the (m+ 1)-st column. 
Similarly, the “streak”, “tree” and “braid” types of groups of dimen- 


sion 2 defined in [5, no. 10], correspond respectively to the matrices 
0 Ff 0 
0 07 0 ey 
11. The structure of abelian formal Lie groups. The construction of 


abelian Lie groups given by Theorem 2 is in fact general enough to give all 


abelian Lie groups, up to an isomorphism; in other words: 


THEOREM 3. Lvery abelian formal Lie group G of dimension n over a 
field K is isomorphic to an abelian group associated to an endomorphism of H". 

We start by remarking that the invariant derivations Y,; (1 Sin) 
of G satisfy relations of the form 


n 
j=1 


We consider the Witt vectors A ij = (a4ij,0,0,- - -), and the endomorphism wu 
of H” defined by the matrix (Aj;) (whose elements are identified with endo- 
morphisms of H, as explained in no. 5). Let L® be the abelian group asso- 


) 
e 
9 
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ciated to uw); then the invariant derivations (1 of satisfy 


the same relations 
44 
o? = Dd oj 
j=l 


(cf. relations (19) ). 

As a basis for an inductive argument. we will next suppose that there 
is an abelian group L, associated with an endomorphism u™ of H". such 
that, if we denote by 1); (resp. X’;;) the generating invariant semi-derivations 


in the hyperalgebra of G (resp. L'”). the following properties hold: 


1. if Z, (resp. Z’q) is the coefficient of y, in the translation operator 
of (resp. L®™). and if =D (eave K), then, for all indices of 
height h(a) <r, we have 7’, = Dd egX"); 
of G and L™ are identical: in other words, if Z7,Zg—= > dog,Z,, we have 
> dapyZ’y for <r and h(B) <r; 


2. the multiplication tables of the subalgebras $,, 9’, of the hyperalgebras 


3. if we have 


then also 


n 
A = >. rj bi,Z’y 
j=1 r 


(the sum extended over indices \ of height h(A) <r). 

Let (¢i):<is, and (¢’;):<i<, be the series defining the group laws of G 
and L® respectively; it follows from assumption 2. that for h(a) <7 and 
h(B) <r, the coefficients of rgyg in ¢; and in ¢’; are the same [4, formula (5)]. 
Let v (resp: w) be the isomorphism of G (resp. L™) on a group G (resp. ZL”) 
obtained by the standard process described in [2] (see also [4, no. 4]), such 
that the group laws of G and LZ” are canonical; we have v;(x) =.2; +: 
(resp. wj(x) =2;-+- - -) for every i. the unwritten terms having total degree 
=. Moreover, the description of the standard process given in [2]. and the 
fundamental fact that (with the notations of [1]) no power series P,(x) 
except the P y-., can contain monomials in 2” [1, formula (34)], show 
(together with the preceding remarks) that the coefficients of 7,. for h(a) <1’. 
as well as the coefficients of the monomials xj", are the same in vy; and w; 
(1St=n). It is then clear that properties 1. and 2. still hold when G and 


LI are respectively replaced by G and L™; but 3. also holds. for we have 


(lSisn) 
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v (Xi) + Dd gaXy and w’ = Li + ginX’y with the same coeffi- 


cients (summation over the indices of height h(A) <1), as follows from the 
preceding argument. 

Now we apply the uniqueness theorem [2, Th. 2], or rather the corre- 
sponding result which is obtained by “cutting down” the power series 
“modulo the monomials 2,?"*'”, and which obviously follows from the proof 
of the uniqueness theorem (see [4, no. 4]): as the “constants of structure ” 
Capy in the algebras $,.. and 3’, are the same, the fact that the group laws 
of G and L™ are canonical implies that properties 1. and 2. are valid for @ 
and when the conditions h(a) <r, h(B) <r are replaced by h(x) <<r+1 
and h({p) <cr+1. 

This is of course still true if we apply to both G and L™ the isomorphism 
w. In other words, if we replace G by the group which is the image of G 
under the isomorphism gq) wv, conditions 1. and 2. are now satisfied 
when in the inequalities on the heights, r is replaced by r+-1; and of course, 
condition 3. holds without change. 

For future use, we remark here that the series (#) =a; 

- 1), Where the unwritten terms are of total degree = p’; this follows 


from the identity of the terms of total degree < p” in v; and w;. 
12. Suppose now that the expression of .\’,,,,2 in the hyperalgebra of 
L™ is 


n 
(24) A rt, = > 4i;? A r+1,j + b in 
j= 


where, in the summation, the indices p are such that h(n) <<r+1. We want 


to prove that the differences 


n 
j=l 


are derivations. To do this, we remark [4, Lemma 2] that we have 


On the other hand, we can write 


(26) ice? =D. GixpZ’ p 
p 


Where the summation extends over indices p£0 of height <7r+1. Replacing 


in (25) and comparing with (24), we see that, for || > 1 


k 


a 


i 
if 
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and 


ye = > Jip k, (p 
k 


for any p such that O<p<yp and any J such that 0<1< p’*?; in other 
words, the b’;, for || >1 are entirely determined by the gixp. But from 
our assumptions it follows that we also have 
(28) = > p (1 = <p) 

p 


with the same coefficients. On the other hand, we have by [4, Lemma | 


(29) Xn (fg) =f + 9° 


O<k<p'*! 
Replacing the Zj.,2 by their expressions (28), and using relations (27), we 


see that in the expression of V;(fg) as a sum Shyy(Z f) (Zug), there is no 
hem 
term in which both X and p are 0 and this proves our assertion. 


13. We can therefore write 


Suppose the endomorphism uw” of 1” corresponded to the matrix (f{/-””), where 
fii” belongs to €*(K). Consider the Witt vectors = (a;j*, 0, 0.- 


and the matrix 


This matrix corresponds to an endomorphism u*) of H”, to which in turn 
correspond an abelian group Lt; if we recall the definition of L°*) by 
means of relations (19), we see that we have finally reached the following 
conclusion: the image of G under the isomorphism q, and the group L°"*», 
satisfy conditions 1., 2., 3. where r has been replaced by r-- 1. 

It is now easy to bring our inductive argument to a close. From the 
last remark of no. 11, it follows that the “infinite product ---q™- - -q%q 
is meaningful and represents an isomorphism of G on a group G. On the 
other hand, formulae (30) show that the matrices (f{%”) “converge” to a 
matrix (f‘)) with elements in €*(X’), corresponding to an endomorphism u 
of H". If L is then the abelian group defined by that endomorphism. our 
construction is such that the constants dg, in the hyperalgebras of G and L 
are the same, and therefore these two groups are identical, which concludes 


the proof of Theorem 3. 
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We add two remarks: the first is that, forming (according to Theorem 2) 
abelian groups corresponding to endomorphisms of H”, we can always limit 


ourselves to the case in which the matrix (f‘/) is such that, if fi) = > #A,,, 


k=0 
the Witt vectors 4; which are ~0 are such that o( Ai) =0; this follows 
from (30). 

On the other hand, Theorems 2 and 3 yield, for abelian groups, the 
following result. which is analogous to the well-known theorem in classical 
Lie theory, asserting that one-parameter subgroups “fill up” the group 

locally: for any element X = i (& eK) of the Lie algebra go of an 
4=1 
abelian group (, there exists a homomorphism wu of the hyperexponential 
group H into G such that u’(Soo) = XY (Sy; being the usual generators of 
the Lie hyperalgebra of H, noted Y,; in [5, no. 5]). One has merely to 
make a linear transformation on the indeterminates in G, which transforms 
JV into Vo, for instance; we have then a homomorphism of U" onto G (no. 9), 
and the restriction of that homomorphism to the first factor of the product, 
composed with an isomorphism of H onto U, satisfies the required conditions. 
It would be very interesting to know if this result extends to non-abelian 


formal Lie groups. 


14. Homomorphisms of abelian groups. It follows from theorem 3 
that the study of abelian formal Lie groups can be reduced to that of the 
groups associated with endomorphisms of groups H/"; we are going to study 
the homomorphisms of such groups. 

Let G and G be two such groups, of respective dimensions n and m, and 
suppose they are respectively associated with an endomorphism u of H” and 
an endomorphism v of H”; wu and v can respectively be represented by square 
matrices U = (f))) and V = (g“)) of order n and m respectively, having 
their elements in the ring €*(K). We shall designate by P and Q the 
algebras of polynomials, by a and 6 the ideals in these algebras determined 
respectively by u and v, so that the hyperalgebras of G and G are respectively 
isomorphic to P/a and Q/b (no. 6); S,; and T,; will be the indeterminates 
in P and Q respectively, X;; and Y,; their classes mod.a and mod. 6 
respectively. 

We will first consider the following question. Let w= (w), <<, be a 
homomorphism of H” into H", w® = -) being n systems 
of power series in the 2,4) (1Sj=m) such that w, contains only the 
tx) such that k =h. To this homomorphism is again associated an n X m 


wi we tee 
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matrix W=(h@)) with elements h“) in €(K), 
such that (with the same conventions as in no. 5) 


j=l 


We will always suppose, in addition, that the h‘/ belong to €*(K) (which 
insures that the w, are in fact polynomials). 
We now ask under which conditions on W the formulae 
V i 4 4 


-) 
define the derivative s’ of a homomorphism of @ into G. 


THEOREM 4. Suppose K is a perfect field. In order that formulae (31) 
define a homomorphism s of G into G, tt is necessary and sufficient that there 


exist ann Xm matria L with elements in €*(K), such that 


(32) UW =L(xz-I—t-V) 

(I being the m Xm unit matrix, and W* being the matrix ((h“/))7) the 
isomorphism o being defined by a? = if 


15. The system of polynomials 


can be represented by the “one-column matrix” W-«x, the indeterminates 
x, being replaced by 7T);. If there is a homomorphism s verifying (31), 


we must have 
(33) = (Wh (Yor, om” 5 ni”, » 


(w,)* being the polynomial obtained by raising all the coefficients of w,\' 
to the power p. But considering the definition of G by means of the matrix 
V (no. 6), we see that the right hand sides of (33) are obtained by replacing 
Tn; by Yn; in the system of polynomials represented by the one-column matrix 
(WeV)-x. On the other hand, the definition of G by means of U and 
formulae (31) show that (if s exists) the left hand sides of (33) are 
obtained by replacing T; by ¥,; in the system of polynomials represented by 
the one-column matrix (UW)-x. Therefore, a necessary condition is that 
the matrix €C = UW —W°V be such that, when each 7); is replaced by Y;; 
in the system of polynomials represented by the one-column matrix C- «x, the 


dit 
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expressions obtained all vanish. This can also be expressed in the following 
way: the polynomials of the system represented by C-x must belong to the 
ideal b. 

This is certainly the case if C=L(n-I—t-V), for w-I—t-V 
—t(p-I—V) (from (3)), and relations (19) (or rather the corresponding 
relations for G) show that the polynomials of the system represented by 
(p-1—V)-«x belong to 6; moreover, if y is a one-column matrix representing 
a system of polynomials of 6, M-y is also such a matrix, for an arbitrary 
nv Xm matrix M with elements in €*(K). Let us now prove the converse ; 
we can write where Ty = (C,“%) is an 
nm matrix with elements in the ring W (it) of the Witt-vectors. If we 


denote by +; Tim) the polynomials of the system represented by 
C-x. and if = (40%, +) K), we have 
m 
Co) (7 * l'on) Coo 7 Oj: 
j=l 


Now we have seen in the proof of Theorem 2 (no. 6) that such a polynomial 
cannot belong to 6 unless all the eyo?) 0; but, as K is perfect, this means 
that there is a matrix A, with elements in YW (K), such that We 
can therefore write C= A,(#-I1—t-V) +#-C,, and (#:C,)-x must belong 
to the ideal b; in other words, we are reduced to the case in which Tr, —0. 
More generally, suppose T;, is the first coefficient in C which is #0; we 
then have 


j=l 


and the same argument as before shows that all the elements ¢;9‘/) must be 0. 
By an easy inductive argument, it follows that C—L(x-I—t-V) with 
the A; being nXm matrices with 


elements in YW (A). This concludes the proof of the necessity of condition (32). 


16. To prove that the condition is sufficient, we consider the homo- 
morphism r of the algebra of polynomials P into the algebra of polynomials 
Q, defined by 


-) 


Condition (32) insures that the image of the ideal a under the homo- 
morphism r is contained in b; hence r defines a homomorphism s’ of P/a 


into ()/b, satisfying (31). It remains to be seen that this homomorphism 


2 25 wee 


| 
a | 
aé 
we 
«oe 
a 
ae 
ne 
ad 
rT] 
? 
aw 
y 
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is the derived homomorphism of a homomorphism s of G into G@: this will 
be done by showing that the conditions of [5, Prop. 2] are satisfied. 

Introducing new sequences of indeterminates $’,;. 7%n;, and using the 
notations of no. 6, we first remark that. as r is a homomorphism of 7? into Q, 
the image by r@r of the element 

o=B=a 

is obtained by replacing in that expression each S,; (resp. 8’,;) by its value 
r(S;;:) (resp. r(S’xi)) given by (34). But from the fact that w is a homo- 
morphism and the arguments in no. 8, it follows that we obtain in that way 


the expression 


where 
and 


(¢ = (¢,) now stands, as in no. 8, for the group law of the hyperexponential 
group H). We thus obtain a polynomial in the T”;;, which (by the argu- 


ment of no. 7) can be expressed as a linear combination of the polynomials 


Let us now go over to the quotient algebras P/a and Q/b; let X’n;. Yn 
correspond to S8’y;, T’,; by the natural mapping. and, changing slightly the 


notations of no. 8. let us write 


o=B=a 


Then the image, by s’@s’, of Z,°, is obtained as a linear combination of the 
elements (35) in which 7}; and 7’,; have been respectively replaced by Vi 
and Y’,;; but these elements are merely the elements Z)°, by the argument 
of no. 8. The conditions of [5, Prop. 2] are thus satisfied, and the proof 


of Theorem 4 is brought to an end. 
17. We will say that the homomorphism s of @ into G. defined above, 
is associated to the matrix W. We now prove (with the same notations) : 


THEOREM 5. Suppose K is a perfect field. Every homomorphism s of 
G into G is associated to an n X m matrix W with elements in €*(K). 


| 
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We are going to construct a matrix W for which relations (31) will hold. 
We can write first 
s’(Xoi) == 0; (1 Sisn). 
j=l 


We consider the Witt-vectors Aj; 0,0,---), and the matrix W = (A;;); 


we have therefore 

As a basis for an inductive argument, suppose we have constructed a 
matrix W such that 
(36) s’ => wr) (Yor, Yom Y nm) 
for SisSn and OShSr. As 8’ is a homomorphism of hyperalgebras, 
we have, by (18) 
(37) 8°(Z,) = 8’ * *)- 
We prove that the differences 

are derivations in G (1Sisn). We have by definition 


(s’°@s’) + 8’ OL 
(38) 
= 8° (Z ie; ) 8’ (Z 


p't! 
Now in the sum in the right hand side of this formula, we can replace s’(Zx«,) 
by its expression (37), and as k <p”, only terms s’(X),;) with ASr will 
intervene. These can then be replaced by their expression (36). Now, using 
(22) with a= (and therefore we see that we have 
(39) @ ( Yo; ) + Wri) (Yor, Y ++1,m) 
+ 8’ ) 8'(Z ) 
O<k< 
since defines a homomorphism w of H™ into H”". As computation- 
with tensor products and with “ Leibniz formulae” are “dual” to each other 


(see for instance [4, proof of Lemma 2]), comparison of (38) and (39) shows 
that = f Uig + g° U;f. 


We can therefore write 


(lStSn). 


wt we??? 
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Consider the Witt vectors = (a"*,0,0,---), and the matrix 
Wr?) — Wo + (A, 9). It is then clear that we have the relations 


for 1Sisn and OShSr+t1. The induction is then concluded in the 
usual way, the matrices W “converging” obviously towards a matrix W 
for which relations (31) are verified for every h. Theorem 5 is therefore 


proved. 


18. The result embodied in Theorems 4 and 5 can be put in a much 
more suggestive form. Multiplying on the left both sides of (32) by # gives 
an equivalent condition, since the ring €*(K) has no zero-divisors; using 
the commutation rules (7). we see therefore that (32) is equivalent to the 
relation 
(40) —tU)W (W + tL) (aI, — 


(I, and I, being the unit matrices of order m and n). Conversely, suppose 


there is an » X m matrix W, such that 
(41) —W ,(al,,— ww). 


From such a relation, it follows at once that W and W, are congruent mod. f, 
in other words we can write W,—=W + tL, and we see therefore that (41) 
implies (40), hence also (32). The existence of an n X m matrix W, over 
€*(K) satisfying (41) is therefore (when K is perfect) the necessary and 
sufficient condition for W to represent a homomorphism of @ into G. But 
this can also be formulated in different terms: consider the left-modules 
Y= (E*(K))", F = (E€*(K))”, direct sums of n (resp. m) modules identical 
to €*(K) (considered as a left module over itself). In FE (resp. F’). let WV 
(resp. NV) be the submodule which is the image of / (resp. F’) under the 
endomorphism represented by the matrix #I,,—#tU (resp. —#). Relation 
(41) is then the necessary and sufficient condition for the matrix W to 
represent an (€*(K))-homomorphism of the module F# into the module F, 
which sends ./ into V, and therefore to define a homomorphism of £/M into 
F/N. We thus see that. by associating with the group G the €*(4’)-module 
E/M, the theory of abelian formal Lie groups over a perfect field K 1s 
essentially reduced to the theory of left €*(K)-modules. 


19. Applications: I. Classification of abelian Lie groups. If s is an 
isomorphism of G onto G (for m =n), the matrix (aj;) (with the notations 
of no. 17) must be invertible. and from that remark it is very easy to con- 
elude that both matrices W and W, in (41) must be invertible. Therefore: 


a 
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TrEoreM 6. Let K be a perfect field. In order that two abelian Lie 
groups G, G over K, respectively associated to two square matrices U, V of 


order n, with elements in €*(K), be isomorphic, it is necessary and sufficient 
that —tU and be equivalent. 


In the language of modules, this (with the notations of no. 18) means 
that the modules F/M and E/N associated to G and G are isomorphic. The 
classification of abelian Lie groups over a perfect field AK’ is therefore equi- 
valent to the classification of the €*(A)-modules having a finite system of 
generators to which they correspond; as mentioned in the Introduction, no 
satisfactory theory is known at present for such modules, due to the fact 
that €*(A’) is not a principal ideal ring. 

[It follows of course from (32) that G and G will be isomorphic if there 
exists an invertible matrix W such that WoOUW-! —V, but this condition is 
by no means necessary, as we shall see below, and the problem of classifying 
matrices under this stronger equivalence relation does not appear to be much 
easier, even when KA’ is algebraically closed. When U is invertible and Kk 
algebraically closed. there is always an invertible matrix W such that 
WeUW —I, as was essentially proved in [4, no. 13]; the corresponding 
groups are therefore all isomorphic to (W,*)". A similar proof would show 
that. when K is algebraically closed and U=#'U,, with U, invertible, there 
exists W such that WCUW —wt'l, and the corresponding groups are there- 
fore isomorphic to the direct product (J,)". 


In particular, for the groups of “svmmetrie braid” type [5, no. 10], 


0 


and therefore (as announced in [5]) if K is algebraically closed, these groups 


we have 


are isomorphic to products J, /,. Similarly. the groups of the “tree” type. 


& 


do not yield new types of groups for r= s, for the matrix is then 


0 ars 
I ) 


and it is readily verified that the matrix W —( 


corresponding to matrices 


I —@r*\. 
0 I ) is such that 


so that the group is isomorphic to W, XI, (as was 


conjectured in [5. no. 10]). 
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Going now to the other extreme, let us consider the case in which 4 is 
the prime field F,, and let us investigate the classification of one-dimenstonal 
abelian groups over A. As o is then the identity, €*(K) is a commutative 
ring, and the problems amounts, according to Theorem 6, to classify elements 


of that ring under the equivalence relation 

(42) = —ta—c(n—tb) 

where ¢ is invertible. We have of course 


where the A, = (Ugo. 1." * *) ave Witt-vectors with components in F,, and 
similarly for b and ec. It is then easily verified that, given Bb, it is possible 
to determine ¢ in one and only one way, such that all the elements a;; = 0 
for «>0. The “first” elements a;,,. are then invariants of the class of 
abelian groups; as they can be chosen arbitrarily in F,, we see that the set 
of classes of non isomorphic abelian groups (isomorphism being meant in the 
sense of F,-isomorphism) has the power of the continuum (in sharp distine- 
tion to the corresponding situation in the case of an algebraically closed 
field K). We may also observe that in this case, the relation WoUW"' —=V 


would reduce to a=b instead of (42). 


20. Applications: II. Analytically simple abelian Lie groups. We 
now want to show that there exist abelian Lie groups of dimension > | 
which have no nontrivial subgroup, and can therefore be called “ analytically 
simple” groups. More precisely, we will exhibit a group @ of dimension 2 
such that there exists no non zero homomorphism of a group of dimension 1 
into G. G will be a group of the “dissvmetric braid” type, corresponding 
t- 


a We may suppose that A’ is algebraically closed, and 


to the matrix 
t 
then a one-dimensional group will correspond to a one-element matrix of 
type #”. The existence of a homomorphism of this last group into G means 
that there exist elements a. a’. b. Bb’ in €*(K). such that 


oe 


—F 
(43) ad) =(b ( ) 
which is equivalent to 
\ 
(44) 


k 


We write a= b= = where the 
i k 
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A,. A", B,. BY), are Witt vectors. It follows from the proof of Theorem 5 
(or by a direct argument from (41)) that we can suppose A; = (dxo, 0,0,°- 
A’, = (a’.9.0.° + +) 3 on the other hand, we can suppose that b and 6’ are 
not both divisible by a power of &, since otherwise a and @ would also be 
divisible by it. and we would then be reduced to the first case. We remark 
finally that we have g > 0, since there are no homomorphisms of the multi- 
plicative group into G which are not trivial, the hyperalgebra of G being a 


nilalgebra. 


1. Suppose By = (Doo. 001." #9. Identifying the coelficients of t° 
in the first equation (44) gives Ap Bo, hence doo = boo and b,; 
for i> 0. We cannot have ¢=1, for in that case, identifying the coefficients 
of in the second equation (44) would give #A’,— A’, = hence 
a’, =0. and A’, 0; but then, the same equation would give B’, = A’, = 0. 


and identifying the coefficients of # in the first equation (44) would give 


which contradicts 0. The identification of the coeffi- 
cients of @ in the first equation gives then (since g > 1), #4,—7B,— B’,™, 
and if B’, = - -), this yields As we have 
also a’,,=0. Now, if g==2. we obtain by identifving the terms in & in 


the second equation (44). 


and this vields by) = 0, contrary to assumption; if on the other hand q > 2. 


we obtain similarly 7.1’, = #%B’; — By, and we reach again a contradiction. 


Suppose now B’, 0; then we have in the same wav 1’, = B’,, hence 


ay, =’, ~0 and b’,,—0 for 1>0. We see as in 1. that we cannot have 


q=1: but if g>1, the relation gives hence 
again a contradiction. It is therefore impossible to satisfy (43) when a@ and 
a’ are not both 0. 
A similar argument shows that if G is associated to the matrix (" in 
\ 

with r<s. then G is analytically simple if s+ r is odd; but if s+r= 2h. 
there is a nontrivial homomorphism of the one-dimensional group corre- 
sponding to t' into G. These examples seem to show that the determination 


of all abelian analytically simple groups is a problem of great complexity. 


21. Applications: III. Duality of abelian Lie groups. Suppose that 
K is the prime field F,; then the ring €*(K) is commutative and the trans- 


posed matrix of any matrix with elements in that ring is again a matrix 


| 
| 
| 
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with elements in €*(K) (whereas in general it is a matrix over the opposite 
ring). If, to each square matrix U with elements in €*(K), we associate its 
transposed matrix tU, we are defining a pairing between the associated 
zroups. which we can call dual to each other. If the matrix W represents a 
homomorphism of G into G, then it follows from (41) that ‘W, represents a 
homomorphism of the dual G* of G into the dual G* of G; we will say that 
this homomorphism is transposed of the homomorphism represented by W; 
the usual rules concerning transposed homomorphisms are then valid. 
One-dimensional groups (over F,) are obviously isomorphic to their 
dual; so are two-dimensional groups of the “streak” or “braid” type; it is 
an open question whether an abelian Lie group is always isomorphic to its 


dual. 
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ON THE LOCAL BEHAVIOR OF SOLUTIONS OF NON. 
PARABOLIC PARTIAL DIFFERENTIAL EQUATIONS. 


III. Approximations by spherical harmonics.* 


By Puitip Hartman and AvuREL WINTNER. 


In this paper, the analogues of the theorems of |4]. [5] on solutions 
of elliptic partial differential equations will be obtained for the case where 
the number of independent variables exceeds 2. For the sake of notational 
simplicity, it will be assumed that the number of independent variables is 3. 
The equation to be considered is of the type Au + - - -=0, where Aw is the 
Kuclidean Laplacian of u, and no second order partial derivative of wu occurs 
in the rest of the equation. The replacement of Aw by a more general linear 
combination of second derivatives of u will not be considered at this time. 
(In the plane, this more general case can be reduced to the special case by 
conformal mappings under suitable smoothness assumptions on the coefficients ; 
in space, perturbation methods of Korn and Lichtenstein can be used.) For 
simplicity, the partial differential equation to be considered will be assumed 
to be linear (the methods are applicable to non-linear equations of the type 
(5)-(6) below). 

The first part of the paper deals with solutions near a zero. the second 
part with solutions near an isolated singularity. 


Part 1. The zeros of a solution. 


1. Statement of the theorems. In the linear elliptic partial differ- 
ential equation 


(1) 


let Aw denote the (Huclidean) Laplacian, Yu the gradient of u, a—a(.«) 
a vector and y=y(z) a scalar function of the position vector x, finally 
a‘ Vu the scalar product of a and Yu. It is known ([4], pp. 449-450) that 
if the number of independent variables in (1) is 2, say t= (2,22), if 


a(v) and y(x) are continuous in a vicinity of c=0 and if u=u(z) is a 
(*-solution (or even a C?-solution in an appropriate sense) vanishing at 


* Received April 18, 1955. 
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«=0, then either u(z) =0 or there exist an integer N >0 and constants 


(c;,¢2) (0.0) such that, as 


u(r) = cos Nd + sin Nd + 


(2) 
Vu(rz) = cos No + sin No) + 


uniformly in ¢, where «= (2,,2.) = (reos¢,rsing). The proof of this 
fact depended on an adaptation of a proof of a uniqueness theorem of 
Carleman [2]. 

The object of this chapter is to obtain similar asymptotic formulae in 
case the number of independent variables in (1) exceeds 2. The methods 
used to prove (2), depending on function theory, cannot be applied in this 
situation. They will be replaced by procedures depending on Fourier analysis 
and on a generalization of the argument used by C. Miiller [6] to obtain an 
extension of Carleman’s uniqueness theorem in the plane to a uniqueness 
theorem in space. 


The analogue of (2) will be 
u(r) =r%Sy(e) + 0(r%), 


V (x) ( rN Sy ( e) ) + 0 


(3) 


where Sy(e) is a spherical harmonic of order N. Here and in the sequel, 


«==re, where so that is a unit vector. 


THEOREM 1. Let x be a 3-dimensional vector, a=a(.r) a 3-dimen- 


stonal continuous vector and y=y(x) a continuous scalar function in « 
sphere about x=0. Let be a C'-solution of (1) on 
| 1 satisfying 


(4) u(0) 

Then either u(x) =0 on |x| <1 or there exist an integer N>O and a 
spherical harmonic Sy(e) 40 of order N on the unit sphere || =1 with 
the property that the relations (3) hold uniformly in e, as r='|r|—0. 


A function u=u(zx) is called a C'-solution of (1) on |r! <1 if it is 
of class C* and can be represented as the sum of an harmonic function and 
the Newtonian potential of (a: Yu + yu) /4r. 


Remark 1. It will be clear from the proof that Theorem 1 remains 
correct if the dimension number 3 is replaced by any dimension number. 


ob 
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Remark 2. It will also be clear that Theorem 1 remains true if (1) 1s 


replaced by a non-linear equation 


(5) Au + g(a,u, Vu) = 0. 
where g(.u. Yu) is a continuous function of its (seven) variables on a set 
|u| Sconst.. | Yul Sconst. and is subject to an inequality of 


the form 
(6) g(a, u, Yu)| S Const. (}u| +) 


Remark 3. The o(r')- and o(rX-')-terms in (3) can be replaced by 
O(rX* logr |) and | logr|), respectively. This improvement of (3) 
follows by repeating the arguments of Sections 6 and 7 below with (61) 
replaced by the estimates, u=O(r') and Yu=O(r'-'), supplied by (3). 
\ similar remark applies to Theorems 1 bis and 4 below and to the corre- 


sponding theorems of [4], [5]. 


As a consequence of Theorem 1, there results the following corollary 
which is an analogue (and extension) of Carleman’s theorem [2] for two 


independent variables: 


CoroLttary 1. Let a(x), y(x) and satisfy the condition of 


Theorem 1 and, in addition, let 
(7) u(r) =o(r") forn=—0,1,°: - 
asr—>0. Then u(x) =0 on |r| 

Another consequence of Theorem 1 is the following: 


CoroLLARY 2. Let a(a). y(x), satisfy the assumptions of Theorem 
1. Then the zeros of Yu(x) cannot cluster at r=0 unless u(r) =0 on 


In another direction. Theorem 1 can be extended as follows: 


THEOREM Ibis, Let a=a(r) and y=y(x) be, respectively, a vector 
and scalar function of class C1 in a vicinity of r=0. Let u=u(sr) ¥0 be 
a (*-solution of (1) in a vicinity of r=0 satisfying (4) (so that there 
ecist an integer N >0 and a spherical harmonic Sy(e) #40 satisfying (3)). 
Then the Hessian matrix of u. SAY Ur, Satisfies, as r—>0, 


Ure = (€)) + 


The details of the proof of this theorem will be omitted. It can be 


obtained by considering the inhomogeneous system of equations, 


Au; + a: V ux yur ° Vu yu = 


(k=1,2,8), 
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for the components of the gradient Yu = (uw, U2, us), obtained by differen- 
tiating (1), and modifying the proof of Theorem 1 in a manner analogous 
to that used in [4], pp. 467-469. 


2. An estimate for ||}u—u” ||. In what follows. x will denote a 3- 
dimensional vector. When it is necessary to deal with components of cr, 
these will be denoted by (2, y,z); there will be no confusion over this double 


use of x. The geographical coordinates of a point e on the unit sphere 


(8) B: 

will be denoted by (6.¢). Hence «= (rcos¢ sin 6,7 sin sin 6, 7 cos 8), where 
r=l|a|. A function w==u(«) can be considered as a function of r and e, 
say u=u(r,e). The symbol ||- - - || will refer to the Z?-norm on (8); thus 


|| w(r,e)|| becomes a function of r. The element of area on (8) will be 
denoted by de. 
Let 8,;(e) be the spherical harmonic 


(9 ) Snj Cal nj ( 6) COS io or j+n CnjP nj (8) sin 


for 7=0,1,---,n or j=1,: -,n, respectively, and n—0,1.: -. where 
the P,;(@) are the Legendre functions and the ¢,; are normalizing factors. 
Thus where -,2n and n=0,1.- -, is a complete orthonormal 
sequence on (8). 

Let a function u—=u(x) of «—re. where r—|2!|, be written as a 
function u(r,e). If, for a fixed r > 0, the functions =f(r.e) and 


u(x) —=u(r.e) are of class L? on (8). let 


(10) F,;(r) = f(r. e)S,;(e)do. (11) U,;(r) = u(r. e)S,(e)de 
e e 


denote the respective Fourier constants. Thus, u(r,e) has the Fourier series 


(12) u(r,e) SU yj (7) Snj(e) 5 
n=0 j=0 
u\(r,e) will denote the partial sum 
‘ N 2n 
(13) uN (re) SU 


n=0 j=0 


Let f(x) be a continuous function on |2!S1 and let u=u(sr) be a 
(-solution of the Poisson equation 


(14) Au =f. 


(2 


2n 
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The main equalities will furnish estimates, as r—>0, for U,;(r) and 
u(r,e) —wuN(r,e)||? in terms of 


(15) (r) =| f(r, e) 


LemMMA 1. Let f(x) be a continuous function on |a|S1 and let u(x) 
be a C'-solution of (14). Let d be a (non-negative) number and N an integer 
satisfying 
(16) N>dA+4. 

Then there exists a constant C (independent of N and 2) with the property 
that. for 


(17) |? =Cy(r), 
wh ere 
1 
(18) x (r) == Ne? — 2A s*(s)ds (So). 
0 


The constant C depends only on bounds for |w|, | on |x| = 1. 


3. Proof of Lemma 1. If u=—u(z2) is of class C*, then differentiations 
under the integral sign show that, by virtue of (14) and (10), the function 


(11) satisfies the singular inhomogeneous differential equation 
(19) (7°U")’—n(n+1)U where ’ = d/dr, 


U==U,; and The general solution U=U(r) of (19), continuous 


at is 
(20) U(r) = (2n + 1)7{r"(A — s'-"Ff'(s)ds) 
‘0 
Where .1=A,; is a constant. Since u is of class C*, it follows that the 


derivative 
(21) U nj (1) ur (1, €)Snj(e) do 
B 


exists, is continuous and can be obtained by a differentiation of (20): 
1 r 
(22) U(r) = (2n {nr"1(A — s'-"F'(s)ds) (n + 1)r-?" 
0 


The formulae (20) and (22), just derived under the assumption that 
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u=u(zx) is of class C*, are valid if w=u(z) is only of class C*. This 
follows, for example, by approximating f and wu by smooth functions. 
By letting r=1 in (20) and (22), 


(23) A=(n+1)U(1) + U’(1) (A = U =U yj). 


Hence, by Bessel’s inequality, 
co 2n 


(24) S const. (|| |]? + | ra 


n=0 j=0 


The relation (20) shows that 


1 A 
“0 


In view of Schwarz’s inequality, 


1 


r r 


1 
J ( 


if 2(n—A—1) > 1. that is, if n >A+ 3. Similarly, 


(f s"**F(s)ds)* ( qs) ( j *\F?(s)ds). 
0 ‘0 
The last three formula lines give 
1 
(26) 4(2n + 1)?U? S A?r?” + (2n — 2A— 3)? s*AF?(s)ds 
+ r?A+3(2n + 2A + f s*AF?(s)ds, 


if n>A+ 3. If this relation is divided by (2n7+1)* and the resulting 


inequality summed over n=N-+1,N+2,:-- and 7=0,-- -,2n, where 
N>A-+4, then Parseval’s relation shows that 


(27) |? =3 & Sr" nj-/(2n +1)? 
n=N+1 j=0 
+. Br?A+3(2N — 2X s-*@(s)ds. 


0 


In view of (24), this leads to (17)-(18). 


| 
r 
r r r 
| | 
(; 
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4, The derivatives u,, uw». Note that. in the deduction of (27) from 
(26), the factor (2n + 1)-* in the expressions leading to the last term in (27) 
was replaced by 1. Since (22) and (25) show that 


(.8) 4(2n S (m+ 1) 


where {- - -} is the expression on the right side of the inequality (25), the 


proof of Lemma 1 implies 


Lemma 2. Let u, f, N, A satisfy the conditions of Lemma 1. Then there 
erist a constant C (independent of N and 2») with the properties that. for 
7. 


(29) r? || up—ur |? SCx*(r), 


where 


1 
(30) _y*(r) = — + — 2a— 1) s*A@(s)ds. 
0 
In fact, the analogue of (27) is 
1 
r? || up-—u,* |? S3 rn A aj? + — 2A s-*@(s)ds, 
n=N+1 j=0 
0 


since (n+ 1)7/(2n+1)?S1. In view of (24), 


2n 
as no. 
j=0 
Clearly, (29)-(30) follows from the last two formula lines. 
A statement analogous to Lemma 2 will be obtained for the case where 
ru, is replaced by the partial derivative ug. 


LemMA 3. Let u, f, N, A satisfy the conditions of Lemma 1. Then 
there exists a constant C (independent of N and 2d) with the properties that, 
jor 0 <= r <I, 

(31) ll wp (7, — ug’ (r, e) ||? S Cy*(r). 


In order to see this, let V,; denote the Fourier constant 
(32) = {f ug (1, €) (e) do 
e B 
of us. Since ds—sin 6 d6d¢, a partial integration shows that 


(33) = — ff u(r, o(e) do. 


« 


233 3 ee 


| > 
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It is clear from (9) that Srj¢——JS8njn if 7=—0,1,:- 
= 1f G—n+1,---,2n. Thus 


(34) jUn j+n or V nj n) j-n 


according as O[jSn orn+15j752n. The factor or j/—n. which is 
majorized by n, is immaterial (as was the factor (n+ 1)* on the right of 
(28) ); so that the proofs of Lemmas 1 and 2 imply Lemmu 3. 


5. The principal lemmas. Let N be an integer and / a number (not 
necessarily an integer) subject to the inequalities 


(35) 2<NSI+4+2. 


The estimates to follow will be based on the assumptions that. as r—-0, 


(36) u(r,e) | (87) @(r) =o0(r). 
These hypotheses, Schwarz’s inequality, (10), (11) and (15) show that, as 
r—0, 

(38) | =0(r'), (39) Unj(r)| = 


Since the last part of (20) is 
Tr 


(40) s™?F'(s)ds =o(r'*) as r>0, 


0 
it follows from 1+2=2 WN, (39) and (20) that, as r-0, 
1 


(41) I=o(1) if n< N, where / =—/,;(r) = A — s?"F(s)ds. 


Thus, if n< N, 


r 
(42) (2n + 1)Uqj(r) (s)ds—r3 s"?F'(s)ds, 
‘0 

where the (conditional) convergence of the first integral is assured by (41). 

Schwarz’s inequality, as used in obtaining (26) from (25), shows that 

: 
(43) (2n (3—2n + ds, 
0 


if (42) holds (for example, if n< N) and if n<yu+3. Omitting the 


-,n and that 


l 

( 
ad 
( 
( 
t} 
( 
w 
in 
al 
(4 
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factor (2n + 1)?=1 on the left and summing over n= 0,1,- - N—1 and 
j=0,1,: > -,2n, it follows that, if N—1<p»-+ 3, 


r 


(44; ) |? << (5—2N + { (s) ds. 
Similarly, 
(445) r? |] |? (5—2N + (3) ds, 
r 
(44, ) S (5 — + 2p) s (s)ds. 
0 


For later reference, this result will be stated as a lemma: 


Lema 4. Let the conditions on u., f in Lemma 1 hold. In addition, 
let there exist an integer N and numbers l, » satisfying (35)-(37) and 


p+ 3 <l+2. 
Then (44, )-(44;) hold. 


It is clear that the convergence of the integral in (44) is imphed by 
(37) and the last part of (45). 
It will be convenient to have also the following results on wu: 


Lemma 5. Let u, f. land N satisfy the conditions of Lemma 4. Then, 


as r—>Q, 

(46) | tf «> 0. 

If, in addition, it is supposed that (35) is replaced by 
(47) 2=N<142, 

then (46) can be improved to 

(48) || Sy || —o(r’), 

where Sy =Sy(e) is some spherical harmonic of order N. 


Since V=/-+ 2, it follows that if n= N, then the expression J = I(r) 
in (41) is O(| logr|) =o(rt) as r—>0, for every «>0. Hence, by (20) 
and (40), Unj=o(r"*) if nN. In view of (44,), this proves (46). 
The last estimate for U,; can be improved to the statement that 


(49) lim =B,; exists if n= N, 
r>0 
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provided that limJ(r) exists (when nN). This is the case, for example, 
2N 

if (47) holds. Hence, if Sy(e) = 3 ByjSy;(e), then (48) follows from (44, ). 
j 0 


This verification of Lemma 5 and the arguments leading to Lemmas 2 


und 3 have the following consequences: 

Lemma 6. Let the assumplions of Lemma 4 hold. Then, as r-0, 
(50, ) || || if «> 0. 
If, in addition, (47) holds, then 
(50-) || — =o(r). 

Lemma 7. Let the assumptions of Lemma 4 hold. Then, as r—0, 
(51,) r || || tf «> 0. 
If, in addition, (47) holds, then 
(51.) r || — (r*Sy), || =o(r). 

6. || w|| and || Vul|. The lemmas just proved will be used to obtain 
estimates for || u|| and || Yu|j under the assumptions (35) and/or (47), 


(36) and (37). 
In view of (37), the integral in (18) is finite if 


(52) A+? <14+2 (that is, A<1+ 4). 


Hence, if (16) and (52) hold, then the function in (18) satisfies 


(53) x(r) = 
Then, by (17), 
(54) || as r 0. 


This estimate and (46) imply that, for every « > 0, 

when (16), (35)-(37) and (52) hold. If, in addition, (47) holds, then 
(55 bis) | || =o(r¥) + O(r*3), 


for some spherical harmonic Sy—Sy(e) of order N. 
If, in the arguments leading to (55), (S55bis), Lemmas 1 and 5 are 
replaced by Lemmas 3 and 6, one obtains 


(56) we || + O(r?), 
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for every « > 0, when (16), (35)-(37) and (52) hold. If, in addition, (47) 
holds, this can be improved to 
(56 bis) || up — (rVSx)¢@ || =o(r') + O(7?). 
Similarly, Lemmas 2 and 7 give 
(57) r|| uw, || =o + O(r2), 
for every « > 0, when (16), (35)-(37) and (52) hold. If, in addition, (47) 
holds, then (57) can be improved to 
(57 bis) r || uy — || + 
It will now be shown that if (16), (35)-(37) and (52) are satisfied, 
then, for every « > 0, 
(58) | Vu | + 
If (35) is strengthened to (47), this can be improved to 
| Vu— V | + 
To this end, note that 
(59, ) = + (592) Ug = — + 
and put 
(59s) Uy — Ugh. 
It is clear that uy satisfies the same type of inequalities, (56) and/or (56 bis), 
as ug. If the equations (59) are solved for u,, one obtains 


= — + hence r? | uz |? S| ru, |? + | wg |? + | wy |*, 


by Schwarz’s inequality. Thus (56), (57) and the analogue of (56) for uy 
show that r || wz || = + O(7*2) for every « > 0. A similar estimate 
holds for || u, || and || w, |). Thus (58) follows. The relation (58bis) is 


obtained in the same way. 


7. Proof of Theorem 1. The partial differential equation (1) can be 
written as (14), where 


(60) f=—(a‘VYu+yu). 


The assumption (4) and the fact that wu is of class C' imply that, for 
V = 1, there exists a spherical harmonic S$, (of order 1) satisfying (3). If 
S,(e) #40, then the proof of Theorem 1 is complete. If S,(e) =0. then, 
for N = 2, 


(61) u=o(rN-1), Vu=o(rN-?). 
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It will be shown that if (61) holds for some integer V(= 2), then there 
exists a spherical harmonic Sy(e) of order N such that (3) holds, where 


Sy(e) =0 is not excluded. 
In view of (60), the assumption (61) implies that (35)-(37), but not 
(47), are satisfied when 1 = N— 2. 


Thus, if A is any number satisfying 


(62) N=142>A43(>a+4+}), 


then the relations (55) and (58) are valid for every « >0. Let 0<e<1 
and A+3—N—e. Then (60), (55) and (58) show that (37) holds if 
N—1—e. 

Thus, the conditions for (55bis) and (58bis) are satisfied if 


(63) A+ 


ef. (16) and (47). Hence there exists a spherical harmonic Sy of order N 


= 


satisfying (55bis). Since « >0 can he chosen arbitrarily small in (63), 


this means that 
(64) u(r,e) —rNSy(e)|| =o(7%) as r> 0. 


Also, (58 bis) gives 


(64 bis) | Vulr.e) — V | =o(7%") as r 0. 


wt 


It will be shown that (61) and (64) imply (3). To this end, let 
(t= Gp(a,&) denote the Green kernel belonging to Laplace’s equation and 
the boundary condition w—0 on the sphere of radius R. Thus, 


(65) 4nG = 4rGp(a.é) = |é—v|1—R| &—2 


where &* = R°é/| €|? is the image of € under inversion of the sphere | é| = R. 
Let P=Pp(x,Re) denote the Poisson kernel belonging to the (surface of 
the) sphere of radius Ff; so that 


(66) P = Pp(r, Re) = (4rR)-1(R? — | x |*)|¢—Re 


Since u=u(z) is a solution of (14), where f(z) is given by (60), it is 
seen that, for r=|z| < R, 


(67) u(x) = + vR(2) and Yu(zr) = VARr(z) + Vvr(z), 


where Hpr(x) is the harmonic function 


(68) H,(x) = Re)u(Re) Rede 
e B 


164 

| 

| 

| 


APPROXIMATIONS BY SPHERICAL HARMONICS. 465 
and vg(«) is the particular solution 


(69) Up (ve) ——fff (a, é)f(E)dr 
of (14). In (69), dr is the element of volume in the sphere | | << 
Since rYSy(e) is an harmonic function, it follows that 


(70) Hp(x) = P(x, Re) {u( Re) — (Re) } 


If then | P(x, Re)R?|Sconst. Hence, by Schwarz’s inequality, 


| Hp(x) —rNSy(x/r)| S Const. || u(Re) —R%Sy(e)|| 
Consequently. (64) shows that, as R—-0, 
(71) —rNSy(a/r) =o(r’) uniformly for | «|< 4R. 
A similar argument shows that, as R—->0, 
(72) uniformly for | 


The assumption (61) shows that the function f(.r) in (60) is o(1\~*) 
as *7—>0. Hence, by (69), 


(73) | vp (x) | = 0(RY-*) fff | G(z, é)| dr. 


The inequality |é—a|"dr = 8rR*, where and (65) 
imply that, as R-0. 
(74) =0(R*) uniformly for | 


Thus, the first relation in (3) follows from (71), (74) and the first relation 
in (67). 


In order to obtain the asymptotic relation for Yu in (3), note that 
(75) =0(RN-) f VG(«,é)| dr; 
cf. the remark leading to (73). The inequality |é—a|-*drS 
and (65) imply that, as R-0, “\IKR 
(76) | Vvur(x)| uniformly for 42. 


Hence, the assertion in (3) concerning Yu follows from (67) and (72). 
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This completes the proof of the fact that (61) implies (3) for some 
Sy(e). Thus, in order to complete the proof of Theorem 1, it remains to be 
shown that if ws40, then it is impossible that 


(77) u=o(r"), Vu=o(r"*) for n=0,1,-- -. 


In other words, Corollary 1 remains to he verified (and it is clear, from the 
result just proved, that there is no loss of generality in strengthening the 
assumption (7) of Corollary 1 to (77)). 


8. Proof of Corollary 1. If / and N are any pair of numbers and if 
(77) holds, then (36) holds and (37) is a consequence of the definition 
(60) of f. Let A be the integer satisfying 
(78) A+4<N<A+ 3 


and let »=A; so that 2N—2A—3=1 and 5—2(N+1)+2421. Then 
the inequalities (17) in Lemma 1 and (44,) in Lemma 4, with NV —1 replaced 


; hy N in the latter, are applicable and give 
1 


0 


where C is a constant independent of N and A and depending only on the 


wi 


bounds of u and Yu. 


re The inequalities (29), (31) of Lemmas 2, 3 and the inequalities (44.), 
2 (44,) of Lemma 4. with N —1 replaced by N, imply that 
(80) i Vu C{N2r?N 2/(1—r?)* + (s) ds}, 
0 


by the argument involving (59). 


The definition of f in (60) and the relations (79), (80) show that, if a 
constant ¢ is suitably chosen (depending only on the bounds of u, Yu, a, y). 
then 

a 


(81) ®(r) Sc{ N?r?N-2/(1 — sr (s)ds}. 
0 
If this inequality is multiplied by 7-*’, a quadrature gives 
(s)ds S c{ N?(2N — 24 — 1) 4 f e-*\ (s) ds}, 


0 0 


166 
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if 0Srs(1—e)4. Thus, 2N—2A—1>2 implies that 


Tr 1 

(82) (1 — dcr?) f s°@(s)ds S c{ + f (s) ds}, 
0 

if OSrs (1—e)4. 

Note that if f(r) =0 in (14), (60), then u(x) =0, for otherwise wu is 
harmonic and satisfies (77). Hence u(z)==0 implies ®(7r) =0 and con- 
versely. 

It will be shown that (77) implies that w(2) =0 on every sphere |x| =r 
with a radius r satisfying 1—4cr?>0. Since c is constant, independent of 
V. A and depending only on the bounds of u, Yu, a, y on |r| <1, it will 
follow that u(r) =0 on | aj 1. 

Suppose. if possible, that u(z) 0 on |z|Sr. Then (7) 40 for 
some where r. Let be chosen so that r= (1—e)4. 
Then (82) is applicable. Since (7) > 0, it follows that the left side of 
(82) is minorized by const. where const. >0. But the right side is 
majorized by Const. (N?r?+ 1-*4). In view of (78) and 7 <7, this leads 
to a contradiction if Ao. 

This completes the proof of Corollary 1 and of Theorem 1 (hence of 
Corollary 2 also). 


Part 2. Solutions having a singularity. 


9. The nature of the singularity. This part of the paper will deal 
with functions w=u(a#) which are C?-solutions of (1) on a punctured 
vicinity of 0, say on 
(83) 


The theorems to follow will consider the three cases in which the solution 


u==u(zx) satisfies one of the following conditions, as r—|2|—0: 

(84) =o(1/r), 

(85) u(r) 

(86) u(x) =o(r--?), Vu(z) =o(r*~*) for some integer V = 0. 


The theorem concerning (84) is a theorem on “removable singularities ” : 


THEOREM 2. Let x, y(x), a(x) satisfy the conditions of Theorem 1. 
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Let u(x) be a C'-solution of (1) on (88) satisfying (84) as r>0. Then 


u(0) can be defined in such a way that u(x) becomes of class C? on | x a1. e 
It will be clear from the proof that the analogue of this theorem is true , 
in the plane. This two-dimensional analogue is an improvement of what d 
follows from the results of [4] on removable singularities, inasmuch as there 
is no condition on Yu as #0 in Theorem 2. Following Bécher [1], the 
theorem on removable singularities is obtained as a consequence of theorems F 
on the existence and uniqueness of a solution having a Green singularity at ‘ 
a given point. 
If (84) is replaced by (85), Theorem 2 is altered to 
THEOREM 3. Let a, y(x) satisfy the conditions of Theorem 1. Let u(.) , 
be a C'-solution of (1) on (83) satisfying (85), as r>0. Then there exists 
a constant c > 0 such that, as r—-0. 
(87) u(x) =c/r+O0(|logr|). Vu(r)=V(e/r) + 0(1/r). 
: 30cher [1] has proved theorems similar to Theorems 2 and 3. In his ( 
; analogue of Theorem 3, the assertion (87) is considerably improved. Lécher . 
° assumes that the coefficient functions a, y are analytic, but his proof does 
: not use the full foree of this assumption. It requires merely that (1) has ( 
> an adjoint and that both (1) and its adjoint possess Green functions (for | 
> some sphere |x| FR). 
Z The proof of Theorem 3 will be similar to the proof of its analogue in (: 
- the plane; cf. [5], pp. 351-352. Note that the assertion of the two-dimen- n 
2 sional theorem (*) in [5] is to the effect that Yu=—cYV (logr) + 0(1/r), 
3 but the proof given there shows that this can be improved to 
Vu=—cV (logr) + O(| logr |). 
= ef. [5], p. 358. 
Theorem 3 is a uniqueness theorem on Green’s singularity. There is an 2 
existence theorem analogous to the (two-dimensional) theorem (**) in [5], ( 
pp. 360-361. 
Theorem 2 will depend on Theorem 3, the existence theorem just men- is 
tioned and the methods of Part 1 above. ({ 
THEOREM 4. Let xv, y(r), a(x) satisfy the conditions of Theorem 1. T 
Let u(x) be a C'-solution of (1) on (83) satisfying (86), as r>0. Then (3 
there exists a spherical harmonic Sy(e) of order N (possibly Sy(e) =0) 
satisfying, as r—>0, ar 
& 


uniformly in e. 
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The proofs of Theorems 2, 3 and 4 will be applicable if the linear 
equation (1) is replaced by a non-linear equation (5) for which (6) holds. 
Although the notation of vector products is used in the proof of Theorem 3, 
it will be clear that this does not affect the validity of the method if the 


dimension number 3 is increased. 


10. Proof of Theorem 4. he proof is similar to that of Theorem 1 
and will only be indicated. Since U=U,; need not be continuous at the 


origin, (20) must be replaced by 


1 1 
(88) (2n4-1)U=r"(A— (s)ds) —rb"(B— ( s'™?F'(s)ds), 
where 4 = A,;. B=B,; are integration constants to be determined. Note 


that the first part of (86) implies that 
(89) U =o(r'*) as 0. 


The assumptions (86) imply that f in (14), (60) satisfies 


(90) as r 0, 
hence 
(91) & = and F = Fyj = 


This estimate of #’ shows that 
1 


(92) rn if for n=0,1,- - - 
In fact, the expression on the left is o(|logr|) or o(7*-*) according as 
(N.n) is or is not (—1,0). 
In view of (89) and (92). the coefficient of r-*-" in (88) is o(1), as 
r—>0(0. if n > N; so that 


(93) (88) reduces to (20) if n >N. 


The (absolute) convergence of the integral which is the factor of r-'-" in 
(20) is assured by the last part of (91). 

It follows that Lemma 1 remains valid if the conditions A= 0 and (16) 
are replaced by 
(94) 


= 
— 
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and (18) is replaced by 
1 


(95) 4 4+ 2A + { ds. 


0 


Corresponding analogues of Lemmas 2 and 3 hold. 


In order to examine the analogues of the lemmas of Section 5. assume 


that 
(96) where N + 2. 


This corresponds to (37); the analogue, || u(7,e) || =o(r-*), of (35) 


is a consequence of (86). It follows that 


1 

s™2F'(s)ds is or | logr!) 

3 e 

, 2 

according as n </—2 or n=I1—?2. Also 

j 1 
lim (B— exists if n >]—?. 
7 Thus, (96) implies that, as r— 0, 
, = r 

> || if N&O and «> 0, 

a and this can be improved to 

3 

- where Sy is some spherical harmonic of order NV. These assertions correspond 
_ to Lemma 5. It is clear that assertions analogous to Lemmas 6 and 7 hold. 


Thus. the arguments in the first part of Section 7 imply the existence 


of a spherical harmonic Sy of order N satisfying, as r—> 0, 
(97) wr) — rb Yule) — V(r = 


if VN=0. The proof of Theorem 4 can now be concluded by a modification 
of the arguments of the second part of Section 7. The Green kernel (65) 
of the sphere «|= RF must be replaced by the Green kernel of a region of 
the type R/4=!2|=4R, and the Poisson kernel (66) by the kernel used 
to determine an harmonic function on R/4 < |r! <4R when its vaiues on 
the two concentric spheres | z| = R/4 and | «| =4R are given; cf.. e.g.. [3]. 
The analogues of (71), (72). (74). (76). when iw! SSR is replaced by 


AR<=/ <2R. can then be deduced. 


> 
SC 

| 

| 
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11. Proof of Theorem 3. The proof will be similar to the proof for 
the analogous theorem in the plane; cf. [5]. Assume first that 


(98) 

in (1). Then. by Theorem 1, the zeros of the gradient of wu can cluster only 
at a= 0. since (98) shows that Corollary 2 of Theorem 1 is applicable to 
u(x) —u(a)) at any point =z, of (83). It follows, by arguing as in [5], 
p. 355. that there exist arbitrarily large constants C such that the locus 
(99) L=Ie: u(x) =C 

is a closed (Jordan) C?-surface surrounding «<—0. Furthermore, 


(100) on L, 


where » is the inward unit normal vector to L (and wu, is, therefore, the 
corresponding normal derivative of w). 

If 7 is a suitable region bounded by a surface S and p is the unit 
outward normal on S, finally v is a suitable scalar function, then Green’s 
identity is 


(101) = {ff (vAu+ Vr: Vu)dT. 
Jus 


S 


Also, if w is a smooth veetor function, Green’s identity gives 
e T e 


where X denotes vector multiplication. If the second of these relations is 


subtracted from the first, one obtains 


e 
S 


(102) ( Vu: (vp 4+ (wX p) )\dS = if vAudT, 
e e 


provided that 
Xv 


For a given v, the last equation has a solution w if and only if v is harmonic. 
[f wis a C’-solution of a Poisson equation (14) and if S and w are sufficiently 
smooth, then (102) remains valid. 

Let € be a 3-dimensional vector and let «—é€ have the components 
(¥.¥.Z). Choose 


(103) v= X/p*, where p=|«—é | 


= 


we 
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(so that rv is the first component of —Y(1/p)). A vector w satisfving the 


equation following (102) is 
(104) w= (0, Z/p*. — Y/p*). 


Apply the relation (102) to the region T bounded by the sphere B: 
1, the locus (99) and a small sphere «—é€| =e about a fixed point r 


(between B and L). On the sphere |«—é€} =e, the outward unit normal p 
is (c—é&)/e. Hence, vp + (w X p) = (1/e,0.0), and so 


lim (ep + (we p)) dS = 4ru, (2) 
0 


(here u,—du/or is the first component of Yu). Also. note that 
Vu: (w Xp) =0 on L. Hence (102)-(104) lead to 


(105,.) 4, (ar) 


where 7’ —7'(C’) is the region bounded by B and L = L¢. 
The relations (105,), the corresponding relations (105,), (105,) for 


Uy(x), and Green’s identity 


(106) (was — f updo + Aud? 


are the exact analogues of formulae (22), (28) used in [5] for the proof of 
the planar case of Theorem 3 (when (98) is assumed). A modification of 
the arguments of [5]. pp. 356-357, leads to the result that 


(107) = lim { UpdS exists 


Coxe 
and to the asvmptotic relation 
(108) Vu=cV(1/r) + 0(1/r) as r> 0. 


The relation (108) and a quadrature of (108) give Theorem 3, in the 
case (98). Finally, the general case can be reduced to (98) by a variation 


of constants; cf. [5|, pp. 359-360. 


12. Proof of Theorem 2. The existence theorem mentioned after 
Theorem 3 is to the effect that if R>O is sufficiently small. then (1) 


| — 
| 
— 
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G(r) on O< SR satisfying G(r) as 0. 


has a C}-solution u 
By Theorem 3, 
=V(e/r) + O(1/r), 


Gi(w) =c/r+ O(| logr |), 


as 0. where c > 0. 
If (84) holds, then u(x) + G(x) isa Ct-solution of (1) SR 
It follows from Theorem 3. (84) 


satisfying + as 0. 


and the last formula line that 
(109) u(x) =O(|logr |), Vu(x) =O(1/r), 


Thus 


and F 


(110) U=U,;=O(|logr|), (111) 
One of the terms on the right of (88) is 


1 
s’"F(s)ds=O(1), O(r|logr|) or O(r) 


(112) 
In view of (110), the coefficient of 


n> 


lence 


according as n=1 
in (88) is 0(1) as 0. 
1 r 


f s"?F(s)ds = s"? ds, 


e e 
0 
Thus 


where the absolute convergence of the last integral is assured by (111). 


(88) reduces to (20) if n=O. 
Analogues of the Lemmas 1-3 give == O(r? + r***) if 


The case n=O of (20) and the estimate for F in (111) show that 


u—uU" | 


= Const. + O(r). Henee || u— Const. || = O(7) as r> 0. 
The proof of Theorem 2 can now be completed along the lines used to 

prove Theorem 1. In faet, the proof of 

u(r.e) — Const. = O(1) 


: ef. the 


is identical with the proof of the first formula of (3) in Theorem 1 
The analogue of (72), namely, YVHr(«) =O(1) 
obtain the analogue 


= $R. is clearly valid. In order to 


replace (75) by 
over 


derivation of 
uniformly for | 2 

of (76), namely, YVer(2) =O(1) uniformly for 
=J,+/,. where /,, /, are the integrals of 


Vu 


Vir(s) 


— 
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the domains |€é| = R/4 and =R, and let Then, since 


Vul|=O(1/r), by (1) and (109). 


fff 


Consequently, =O(1) as Thus f(x) in (14). (60) 


is bounded. From this, it follows that u(x) is of class C? (at «=U also). 


This proves Theorem 2. 


THE JOHNS HopkKINS UNIVERSITY. 
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ON THE ASSIGNMENT OF ASYMPTOTIC VALUES FOR THE 
SOLUTIONS OF LINEAR DIFFERENTIAL EQUATIONS 
OF SECOND ORDER.* 


By HarTMAN and AUREL WINTNER. 


Let both coefficients of the homogeneous, linear differential equations 


(1) 2’ + +f(t)e=0 
be given as continuous functions for large positive J. 

Suppose that, corresponding to every given value, c, the differential 
equation (1) has a unique solution «= .2r(t) satisfying 
(2) 
as L—>»o0o (so that x(o) can be assigned as an “initial condition ~ and this 
single integration constant determines the solution x(t) uniquely. even though 
(1) is of second order). Then (1) will be called of type (*). Clearly, (1) 
is of type (*) if and only if (1) has exactly one solution satisfying the case 
c==1 of (2) and, in addition, the case ¢ 0 of (2) is satisfied only by the 
trivial solution (==0). It is also clear that, since the general solution of (1! 
contains two arbitrary constants, r(o) cannot exist (as a finite limit) for 
all solutions x(¢), if (1) is of type (*). 

If 2(o) exists (as a finite limit) for all solutions and if x(0) ~0 
holds for some solution, then (1) will be called of type (**). Clearly, this 
will be the case if and only if there exist a solution satisfying the case c= 1 
of (2) and a non-trivial solution ($40) satisfying the case c—0 of (2). 
by adding to the former any constant multiple of the latter, it is seen that 
(1) is of type (**) if and only if there belongs to some and/or every c+0 
two, linearly independent, solutions x(¢) satisfying (2). 

The following theorem will first be proved: 


(1) In order that (1) be of type (*), tt is sufficient that 


(i) the coefficient functions of (1) satisfy the conditions 


(3,) fi f(t)| dt<o, (32) g(t) =0. 
and it is also sufficient that 


* Received January 3, 1955. 
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(ii) condition (3,) of (i) be reduced, but condition (3.) of (i) be 
strengthened, as follows: 


(4,) (4:) g(t) Sconst. < 0. 

Condition (3,) of (i) cannot be relaxed to (4,) if (4.) is relaxed to (3.), 
as in (i). Otherwise it would follow that (1) is of type (*) whenever (4,) 
holds and g(t) =0. But this is not true. In fact, it is known that if f(/) 
is real-valued and does not change sign, then (3,) is not only sufficient 
(Bécher) but necessary (Weyl) as well in order that the case g(¢) = 0 of (1) 
be of type (*) (for a simple proof, cf. [6]). 

It will be clear from the proof of part (i) of (1) that the assumption 
(3.) of (i) can be generalized to 


(5) g*(t)di where g* =max(0,g); 


so that (3,) and (5) (hence either (3,) and f ig(t)|di<« or (3,) and 


(3.)) are sufficient in order that (1) be of type (*). Note that g (but not f) 
is here assumed to be real-valued; cf. the remark preceding the proof of 
(1) below. 

The proof will show that, under the conditions of (1), it is possible to 
assert much more than the fact that (1) is of type (*). As will be seen 
from the proof of (1), condition (3) or (4) implies that (1) has a pair of 
linearly independent solutions «= .w2,(t), satisfying, as 


ry(t)—1, 2x,’=0(1/s); a(t)~s, 


where » p(t) and s=s(t) are defined by (10) and (11) below. 
A simple illustration of part (ii) of (1) is the following known fact: 
Whenever (4;) is satisfied. 


(6) —{1+f(t)}z—m0 


has a (unique) solution z(¢) which is asymptotically equal to e-'. that is. for 
which the function x(t) =e'z(t) satisfies the case c—1 of (2). In order 
to see this, it is sufficient to observe that substitution of z= e-tr into (6) 
leads to the case g(t) ==—2 of (1); so that part (ii) of (1) becomes 
applicable. 

A criterion which belongs to property (**) in the same way as part (ii) 


of (I) belongs to property (*) is as follows: 


“ 
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(11) Jn order that (1) be of type (**), it is sufficient that the coefficient 
functions of (1) satisfy the conditions 


bast if f(t)dt <<, (72) Re g(t) = const. > 0. 


Note that, while (7,) is identical with (4,), condition (7.) is a dual of 
condition (42), if g(t) is real-valued (which is not assumed in (II)). Since 
(II) corresponds to part (ii) of (1), it is worth mentioning that what would 
correspond to part (i) of (1) as a (**)-criterion is false. In other words. 
(3,) and g(t) =0 together fail to assure that (1) is of tvpe (**). This is 
seen by choosing f(t) =0 and g(t) =0. 

It should also be remarked that (I) becomes false if g(/) is allowed 
to be complex-valued and g(t) is replaced by Reg(¢) in (32) and/or (4.). 
In order to see this, consider the case f(t) ==0 of (1) (so that f satisfies 
(3,)). The general solution of the corresponding differential equation (1) 
ix ¢, + c.s(1), where c,;, ¢. are arbitrary constants and s(t) is defined by 
(10), (11) below. Thus the desired example results if Reg(t) =—1 and 
s(%) =lims(?) exists as a finite limit (for, in this case, (1) is of type 
(**). not (*)). If g(t) =— (1+ 7fe*). then 


u 
> 


s(t) = and p(u) ve’ dv. 


Define w—w(u) = j ve’ dv, so that e“du=dw/u. The change of integra- 
tion variables u—>w gives for s(¢) the formula 


w(t) 


s(t) = (e'”/u(w) )dw, 


Where u=u(w) is the inverse of w—=w(u). It is clear that w(t) 2 as 
¢—>co and that 1/u(w) tends monotonously to 0 as wo. Hence s(o) 
exists (as a finite limit). Thus the case f=0, g=—(1-+itet) of (1) 


is not of class (*). 
Proof of (1). It is known that, if h(s) is defined and continuous for 
large positive s, then the condition 


(8) f s|h(s)|ds 
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(and even a somewhat weaker assumption; cf. [5], [8]) is sufficient in order 
that the differential equation 


(9) ¢+h(s)x—0, 


where dz/ds, has two solutions, say and satistying 
@(s)~s (and ¢,—0(1/s), as so. Hence (8) is 
sufficient in order that (9) be of type (*). Both assertions of (I) will be 
reduced to this fact. 

First, whether (42) or just (3.2) is assumed, it is clear that p—yp(/), 


where 
t 


(10) p= exp — g(u) du, 


is a positive function, hence s s(t), where 

(11) ds = pdt, 

is an increasing function, and that so as t->0 (the choice of the integra- 
tion constants in (10) and (11) will be immaterial). On the other hand, if 
the independent variable ¢ is replaced by s in (1), it is readily verified from 
(10) and (11) that (1), where 2’ = dz /dt, goes over into (9), where «= d/ds 
and 

(12) h = f/p?. 


Since (8) implies that (9) is of type (*), it follows that (1) will be proved 
to be of type (*) if it is shown that, whether (3,) and (3:) or (4,) and (4.) 
are assumed, the function (12) of s will satisfy (8). Since s =~ corresponds 
to too, this requires that 


/p?(t)ds(t) 


or, What in view of (11) and (10) is the same thing, that 


t 


Ad (i). It is clear from (3.) and (10) that »(¢) increases with ¢. This 
implies that the indefinite integral of »(/) is majorized by a constant multiple 
of tu(t). Hence (13) follows from (3,). 


Ad (ii). If only (4;) is assumed, (13) will hold if the quotient 


t 
(14) 
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is bounded as £00. On the other hand, in view of an applicable “ O-variant ” 
of V’Hépital’s rule, (14) will be bounded if the quotient p(¢)/p’(t) is. 
But (10) shows that the latter quotient is identical with the function — 1/g9(t). 


Finally, the boundedness of this function is assured by (42). 
Proof of (11). It is known ([4], pp. 55-56) that if p(1) #0 and q(1) 
are, for large positive ¢, continuous functions satisfying the pair of conditions 


(15:) f | p(t) | dt (15.) | q(t) ( | p(u)|-*du)dt 
e e 
then the differential equation 


(16) (p(t)2”)’ + q(t)x=0 


has a pair of linearly independent solutions, say «—.2#,(¢) and «—v.(t), 
satisfying and x.(t) as to. This means that(15,) and 
(15.), where p(t) 40, imply that (16) is of type (**). 
On the other hand, if (1) is multiplied by the (non-vanishing) function 
e/(), where 
t 


(17) jt) = f g(u)du, 
then (1) appears in the form (16), where 


(18,) q(t) = f(t) ei. 


Hence (11) will be proved if it is shown that (7,) and (72) imply (15,) and 
(15.) by virtue of (17)-(18.); in other words, that 


(19,) { | | di oo, 
(19.) | f(t) | ( f | | du) dt 
t 


If c>0 denotes the constant occurring in (72), it is clear from (17) 
that | e/ | is majorized by a constant multiple of e°t, which implies that 
(19,) is satisfied. But it is also seen from (7,) and (17) that 


| duxe |Reg(u) | | duset| |, 


t 


= 
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is. 
A theorem of somewhat different nature will now be considered. 
(111) In the differential equation 

(20) + f(t)z=0, 


let f(t), where OS t be a complex-valued, continuous function having 
a non-positive real part. Then (20) has a solution z(t) 40 corresponding 
lo which |2(t)|* is a non-increasing, conver function on OSt<oo. In 
particular, the limit of | z(t)| as to exists as a finite limit (20). Any 


solution z= w(t) of (20) linearly independent of this solution z(t) 1s subject 


lo the condition | w(t)|—o as 


If f(t) is real-valued, then (III) reduces to a well known theorem of 
A. Kneser. In order to prove (IIT), let z= -+ ty and f(t) =a(t) 4+ 16(1). 


Then (20) can be written as a real system, 


+ F(t) (*) —(). where 
y b al 


Since a0 by assumption, the symmetric part of the matrix F is non-positive 
definite. Hence (IIT) can be concluded from a result of [5]. 


Appendix. 


When confined to the real field, both (1) and (II) refer to cases 
in which (1) is non-oscillatory (cf. below). This Appendix deals with the 
general class of non-oscillatory differential equations (1) under the assumption 
that the,a’-term is absent from (1). 
For large positive ¢. let f(t) be a real-valued, continuous function. and 
consider only those solutions 2(¢) of the differential equation 


(1) x’ +f(t)r=0 


which are real-valued and distinct from the trivial solution (#0). Thus. 
since the values of « and 2’ at a given ¢ determine a solution of (1) uniquely. 
x and 2 cannot vanish simultaneously, and so z(t) must change sign wherever 
it vanishes. If there exists a particular solution z(¢) which does not vanish 
for large ¢t, say for fj) St<o, then (1) is called non-oscillatory, and other- 


wise oscillatory. This classification of the differential equations (1) is inde- 


Hence is satisfied if f | f(t)ei | ot | eI | dt is, that is, if 


th 


Wi 
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pendent of the choice of the particular solution .(¢), since, in view of Sturm’s 
separation theorem, every solution of (1) must have zeros clustering at {= 
if one solution has such zeros. 

If (1) is non-oscillatory, then, for reasons which will be apparent from 
the results to be proved below, a solution «(¢) of (1) will be called principal 
or non-principal according as the (continuous) function 1/7(¢) is not or is 
of class L? on the half-line |¢), 0) when ¢, is large enough (namely, larger 
than the last zero of x(t)). 

If x(t) is a non-principal solution, that is, if the second factor of the 


product 
(2) x(t) f ds 


can be formed (for /=/,), then two differeniations of the product show that 
the function (2) is a solution of (1), since 7(¢) is. These two solutions are 
linearly independent, since their ratio is not a constant. Actually, the solu- 
tion (2) can be obtained by solving for y—y(t) the case c——1 of the 
differential relation (3) below. 

Let r—-sr(/) and r=y/(t) be any two solutions of (1). Then their 
Wronskian is a constant, 


(3) ry’ — ya’ =e, 


and the constant ¢ is distinct from 0 if and only if the two solutions are 
linearly independent. Suppose that they are, that (1) is non-oscillatory, and 


that ¢ is large enough. Then (3) can be written, on the one hand, as 
(4) (log y)’— (log r)’ =c/(.ry), where c0, 

and, on the other hand, as 

(5) (y/x)’ =c/x*?, where 


With the aid of these formulations of (3) (and of its formulation (2), 
specified above), the following theorem will be proved: 
THEOREM. /f (1) ts non-oscillatory, then it has a principal solution, and 


the latter is unique to an arbitrary constant factor (~0). 


This contains the following corollary (which corollary of the theorem 
will, however, be needed in the proof of the theorem) : 


CoroLiary. If (1) ts non-oscillatory, then one (at least) of any two 
of its linearly independent solutions is non-principal. 


+ 
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This, in turn, implies the existence of a non-principal solution. .\ proof 
for the existence of such a solution is between the lines of [1], p. 703 (and 
the properties of non-principal solutions are further analyzed in [2], p. 633). 
The following proof of the Corollary is a simplification of the proof just 
mentioned. 

Let x(t) and y(t) be the two solutions referred to in the Corollary. 
Then, since (5) holds for large ¢, the function y(t) /xz(t) is ultimately mono- 
tone. Hence 


(6) y(t)/xr(t) oC, as ton, 


holds for some limit C’, provided that C =o and C =—wo are allowed. The 
latter two possibilities can, however, be reduced to the case C = 0, since x(t) 
and y(t) can he interchanged. But if (6) holds for some finite C. then it is 
seen from (5) that an indefinite integral of the square of 1/z(¢) must tend 
to a finite limit as f—>00. Since this means that x(¢) is non-principal, the 
assertion of the Corollary follows. 

Besides the Corollary, part of the following Lemma will be needed in 


the proof of the Theorem. 


LemMA. Jf (1) ts non-oscillatory and if x(t) is a fixed non-principal 
solution of (1), then every solution of (1) 1s of the form 


(7) Ca(t) + 0(| x(#)]). as 


where the constant C is or is not 0 according as the solution (7) is principal 


or non-principal. 


In order to prove this, suppose that y(¢) is a non-principal solution. 
Then, since «(#) is supposed to be non-principal, and since the product of 
two functions of class L? is of class L. it follows from (4) that log y—log sx 
tends to a finite limit as /—>0,. which means that (6) holds for a finite 
C0. This proves that assertion of the Lemma which concerns the case 
CA0 of (7). In the remaining case. the assertion of (7%) is that every 
principal solution must be o(| x(t)!) if #(¢) is non-principal. 

If x(¢) is non-principal, then the solution (2) exists (for large 4) and 
is of the form x(t)o(1), which is o(!| 2(t)|). Hence. by that part of the 
Lemma which has already been proved, the solution (2) cannot be non- 
principal. Consequently. the proof of the Temma as well as the proof of 
the Theorem will be complete if it is assured that there cannot exist two 
linearly independent solutions both of which are principal. But this is assured 


by the Corollary. 
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Remark. (Added 4.8.1955). The notion of a principal solution has a 
curious application to the general case of (1), the case in which f(#) is 
real-valued and continuous for large positive ¢ but (1) is not required to be 
non-oscillatory. 

First, it is easily verified from (3), (1) and the case cy of (1) that 
y= (z?-+ y*)? > 0 is a solution of 


(8) + g(t)r=0, 


where g(t) =f (t) —c?/r*(t), c? > 0. Since r= r(t) is positive throughout. 
(8) is non-oscillatory and has therefore both principal and non-principal solu- 
tions. To which of these two types will the particular solution r= (.c? + y?)2 
belong? The answer turns out to be as follows: r= (r*-+ y?)* is a non- 
principal or principal solution of (8) according as (1) is not or is non- 
oscillatory. 

In order to see this, consider the complex-valued solution z=. -+ ty of 
(1). Then z0 and 20, by (3), where c 0, and it is also seen that 
the imaginary part of 2’/z is c/r?, where r=|z|. Hence (arg z)’ =c/r?. 
This implies that arg (« + iy) is a strictly monotone function of ¢ and that 
| r(t) is of class L? if and only if arg (x-+ iy) is bounded as {—>x. Since 
the latter condition is satisfied if and only if « (and/or y) fails to acquire 
an infinity of zeros as t—>0o, it follows that 1/r(t) is of class L? if and only 
if (1) is non-oscillatory. This proves the last italicized statement. 
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MAJORANTS IN SPACES OF INTEGRABLE FUNCTIONS.* ' 


By G. G. LoRENTZ. 


1. Introduction. Let f(x) denote an integrable function on the interval 
(0.1). The Hardy-Littlewood majorant of f(z) is the function 


ay 


provided it exists almost everywhere (a.e.). Hardy and Littlewood have 
shown (see for example [5], p. 244) that if fe L?, then @(f) is defined and 
also belongs to L?. More generally, if X is a certain space of integrable 
functions over (0,1), we shall say that XY has the Hardy-Littlewood property 
(or shortly that Xe HLP) if feX always implies 6(f)eX. The most 
general spaces Y which we shall consider are the Kothe-Toeplitz spaces X (C) 
defined as follows (compare [3]). Let C be some class of integrable non- 
negative functions c(z) on (0,1), which contains a function greater than 
2 positive constant. A function f(x) belongs to X(C) and has norm | f |! if 


(2) | f f(a) | < fo. 


PL e 


V(C) is a Banach space [3]; its dual Y’ is the space of all integrable func- 


tions g such that . fg dx exists for all fe Y. In this case the suprema 
a1 
of fg dx for all f with || and of | f\g|dx for all f20 with 
0 
| f || S1 are equal and finite, and by definition equal to the norm of g in X”’. 
Thus our “dual space of XY” is different from “conjugate space of Y” in 
Banach’s terminology (i.e. the space of all bounded linear functionals on X). 
The use of dual spaces rather than conjugate spaces is justified by the fact 
that the former are simpler and that the second dual of X may be X even 
if Vis not reflexive. Actually, the relation \’’ = Y holds for any Kéthe- 
Toeplitz space. 
The importance of Kéthe-Toeplitz spaces is underlined by the fact that 
they are identical (as has been shown by the author in a paper under prepara- 
* Received December 2, 1954. 
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tion for publication) to an apparently more general class of spaces ([2], p. 415) 
described in terms of lattice operations in XY and the norm ||f || only, without 
reference to the class C= {c}. Essentially identical with these are spaces 
discussed by Ellis and Halperin [6], p. 576. 

Our problem will be to give necessary and sufficient conditions for C in 
order that a Kothe-Toeplitz space X(C) should have the Hardy-Littlewood 
property. 

We shall assume that C is rearrangement-invariant, that is, with each 
c(z) any of its rearrangements [3] c,(x) also belong to C. (Then X(C) 
will also be rearrangement-invariant.) In this case the answer is easily 
given (see Section 2): for each ceC, the function 


(3) v(t) /tdt 
should belong to X’. 


Special cases of spaces X¥(C) with rearrangement-invariant C are spaces 
A(¢), M(¢) [3] and Orlicz spaces Le (for an exposition of the theory of 
Orlicz spaces see [4]). Here the condition just given may be simplified so as to 
bear directly on ¢ or &. This is carried out in Sections 3-5. Spaces A(¢, p) 
and M(¢,p) (see [3]), with p>1, are not very interesting in our problem, 
since the original argument of Hardy and Littlewood is sufficient to show 
({1], [2]) that they have the Hardy-Littlewood property. 

The importance of the Hardy-Littlewood property lies in the fact 
that it is equivalent to the dominated convergence a.e. of the quotients 
fr(z) =h {FP (x-+h)—F(x)}, where F(x) is the indefinite integral of 
f(x), to the latter function, whenever f(z) lies in X, the common majorant 
for the f, being 0(x;f). In lattice-theoretic terms, this dominated conver- 
gence may be shown to be equivalent to the order convergence of f, to f in 
the Banach lattice XY endowed with its natural partial order. The dominated 
convergence of f, to f implies dominated convergence to f of many types of 
singular integrals; for example, the Fejér and Poisson integrals (compare 
[5], p. 247). These facts justify our interest in the Hardy-Littlewood 
property. 


2. Hardy-Littlewood property of the spaces X(C). For a positive 
and decreasing function f(x), the majorant 6(z;f) is obviously equal to the 
function 


(4) (2) 
70 
It is therefore natural to consider the properties of a space X(C), which 
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we call HLP’ and HLP”, defined by the requirements that fe X(C) for all 
feX(C) or that fe X(C) for all positive decreasing f in X(C). 
Since f(x) is contained between 6(2;f) and 


—6(z;—f) —int f(t)dt/(y—2), 


it is clear that HLP implies HLP’, and this in turn implies HLP”. 


Lemma. If C is rearrangement-invariant, then all properties HLP, 
HLP’. HLP” are equivalent, and each implies 


with some constant A. 


Proof. If HLP” holds and fe X(C), then f*, the decreasing rearrange- 
ment of |f |, also belongs to X(C), and hence f* =0(f*)eX(C). By [2], 
pages 420-421, 6*(x2;f)< 26(z;f*). (This means that 


0 
for all x=0.) Therefore 
1 al al 
0 0 


so that 6(f) eX(C), and we have established HLP. Also, if (6) is satisfied 
for all positive decreasing f, then (5) must hold. It will now be sufficient 
to show that HLP implies (6). Let Tf =f, be the sequence of operators 
from X(C) into itself, defined by placing f,(a) = f(x) or f»(x) =0 according 
as 1/fn or OS By [3], Theorem 3.8.3, are bounded 
linear operators. For x > 1/n we have —0(2;—-f) Sf,n(x) S0(2z;f), hence 
there is a function F in X(C), e.g, F=|0(f)| + |6(—f)|, so that 
| fx(x)| S F(x) for all x and n. Also, f,(z) as n—>0o. This implies 
that f,—>f in the norm of the space X(C’). Since the limit of a convergent 
sequence of linear bounded operators is bounded, we see that the operator 
Tf—f has a bound. This establishes (6) and completes the proof. 
For fe X(C), f=0 we have, with the notation (3), 


1 ax 
ll f = sup c(x)dx/x f(t)dt 
ceC J 0 
(7) 


= sup f(x) dx fe(t)/tat—sup f(x) (a) dz. 
Jz ceC 0 


ceC 
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The necessary and sufficient condition for (C)eHLP is, by the Lemma, 
that || = ||, feX(C). Remembering the definition of the dual space, 
we see that Y(C) e HLP is equivalent to | ¢l|_y. SA. In this argument we 


could have restricted fe X(C) to be positive and decreasing. Let C* denote 
the subclass of C consisting of all positive decreasing c*eC. Then instead 


of (7) we get, since f(x) is positive and decreasing, 


|| f sup = sup 
ceC*/0 


c*eC*J0 


Hence we obtain: 


TuroremM 1. If C is rearrangement-invariant, then X(C)e HLP tf 
and only if for some constant A, 


(8) 
for all ce C, or, equivalently, for all ce C*. 
We note also another condition for X¥(C)ceHLP. For 0Sa<1, let 


fa denote the function =f(azv). Since 


f | sup c(a)de/x | = sup fi f(az)| da 


ceCe/’0 


we obtain by Lemma, that for X(C)eHLP it is sufficient that for some 
constant A and all fe X(C), 


(9) Vfelxda<A If lx. 


38. Spaces &(¢) and MEI(¢). Let be a decreasing positive inte- 
grable function on (0,1), which we shall assume zero for 2>1, and let 


@(z) =  “o(zyae. The space A(¢) consists of all functions f(z) such that 
ty 1 1 
(10) fla= (x) f*(x)dx = sup f(x)| dx << +00; 
70 r 0 


the supremum is taken over all rearrangements ¢, of ¢. Dual with A(¢) 
is the space M(¢#) of functions f with 


(11) | Fle —sup (1/6(a)) f 


Obviously A(¢é) and M(¢) are spaces Y(C) with rearrangement-invariant C. 


with M = (2—e) /e. 
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For A(¢), C* consists of the single function ¢. Applying Theorem 1 and 
formula (11) to this C*, we see that A(¢) e HLP exactly if 


0<aX1. 


Changing the order of integrations we see that this is equivalent to the 
existence of a constant M such that 


1 7a 
(12) Me" { (x) de, 


This condition will be simplified. 


THEOREM 2. A space A(¢) has the Hardy-Littlewood property if and 
only if 
(13) lim sup ®(2a)/®(a) <2. 


Proof. If (138) holds, there is an d and an e>0O such that 
@(2a)/@(a) << 2—e for 0<aSa. This quotient is a continuous func- 
tion of a for a=ado, which can assume the value 2 only if ¢(2) is constant, 
which is ruled out by (13). Hence we can assume that the inequality holds 
for all 0 <a=1. Then we shall have 


(14) { $(2)de, 0<aX<l, 


These two inequalities give 
1 7a 2a 
e/a 70 0 Je 


+f 
2k-1g 


<(14+(1—) +—9(1—«/2) - 


= (M/a) f od, 


This proves that (13) is sufficient. 
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Now assume that (13) does not hold. We shall show that (12) is 
false for an arbitrarily large M. 
For M chosen, take a natural number n > 2M and an e > 0 so small that 


(16) 4(1— 2%e)n > M. 
By hypothesis, there is a 6 with 0 <b <1 such that 


(17) (2—e) J eae, (1—«) 


Then 
b/2 
(18) = (1— 2e) dz, 
b/2 0 


for otherwise we would have 


2b b b 
< +(1—2) f ode, 
Zt 0/2 0 0 


b/2 


which contradicts (17). Repeating this argument several times, we obtain 


for k=1,2,---,n, 
(19) f = (1— 2%) gdx=(1—8)  ddz, 
J 0 70 


where 6=2"e. Furthermore, 
ab/2 


J 0 0 


0 
After these preparations we obtain for our special choice of b, by (18) 
and (19), 


a1 /2"-1 ab 
b(x)/x dx = 2"-1/b dda 2"-?/b 1/b f 
b/2" e b/2" h/2"-1 b/2 
b/2"-1 b/2 
> (1 — 8) /b{2" 4 gn-2 } 
0 0 
0 0 0 
/2n 
= 3(1—8)"n 2"/b 
Because of (16), we see that for the chosen JM, (12) is violated for 


a=b/2" <1. Thus (18) is necessary. 


THEOREM 3. A space M(q) has the Hardy-Littlewood property if and 


only if. for some constant A, 


(20) for OSaZ 
70 


= 


G. G. LORENTZ. 


A(q@), the condition (8) becomes 


1 1 
2 
and this is equivalent to (20). 


4. Orlicz spaces. Let 0=¢(u)=-+ © be a function defined for u= 0, 
increasing to infinity with wu and vanishing for u==0, but not identically 0 
or +o for allw>0. Let w(u) be its inverse, defined in a familiar way 


everywhere, except perhaps on a countable set of points corresponding to 
the discontinuities of ¢. Let @(z) and ¥(x) be integrals of ¢ and y over 


(0,7); ® and © are called conjugate functions (in sense of W. H. Young). 
The Orlicz space Leg consists of all functions f(x) on (0,1) such that 


a1 


f fg dz exists for all g with 
0 


and we define 
Ifle— sup f fgdr. 
Iy(g)=1 0 
This space L» is a Banach space; it contains all functions f with [s(f)<+ 0, 
but in general also other functions. For example, if ¢(w) =0 for OSu1 
and ¢(u) =+o for u>1, Le is the space M of bounded functions, but 
Ip(f) <+-co means that | f(x)|1 a.e. However, Lg may be described 
as the set of all functions f such that for some A> 0, Ig(Af) <<-+ 0 ; indeed, 
it may be shown [4] that Je(f/||f|e®) 1. It follows that the spaces Le, 
Ly are dual to each other. Under the additional hypothesis that ®(2u)/(u) 
is bounded, a stronger result is true: Ly is the conjugate space to L». For 
the information of the reader we note that the boundedness of 6(2u)/®(u) 
is also necessary and sufficient for the space Lg to be separable. While the 
sufficiency of the condition is well known, its necessity seems to have 
escaped the attention of the writers on this subject until recently, when it 
was discovered independently by Mr. W. Luxemburg (Delft), Sobolev and 
Krasnosel’skif, and the author. 


THEOREM 4. A space Le has the Hardy-Littlewood property if and 
only if the conjugate function W satisfies 


(21) (2b) < M¥(b), b= bi, 


for some constants M and b, > 0. 


Proof. The functions c(z) of the class C* are here c(z) —®(a)" or 
c(z) =0 according as O=[¢%Sa ora<zxSl. Since the dual to M(¢) is 
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Proof. It is easy to see that (21) implies the existence of two positive 
constants «, C such that 


(22) U(ab) Ca*w(b), b=b,a>1. 


To prove that the condition is sufficient we shall show that it implies 
(9). For fa(x) =f(ax), fe Le we have with c(x) 20, 


sup f. "| f(az)| de. 


We shall begin by replacing arbitrary positive functions c(x) by others such 
that, for each x, either c(x) = by of c(x) —0. To achieve this, assume that 
c(z) 20 and Jy(c) =1, and decompose the interval (0,1) into two sets 
E,, EH, defined by the inequalities c(z) < bo, and c(z) 2 by. Since 


(the last inequality follows from the fact that a small constant c(z) =A > 0 
satisfies Iy(c) =1), we have 


Iy(c’)S1e 0 


where the c’(a) are such that either c’(x) = bo, or c’(x) =0. On substi- 
tution az = t we obtain that the last supremum is 


sup 1/a 

(23) 
< sup f | f(t) | dt. 


Ly(e’)Sa 0 
Assume first that Ca=1. Because of (22), Sa implies Iy((Ca)-/4c’) 
<1. Applying this to (23) and replacing (Ca)-1/*c’(t) by c(t), we obtain 
in this case 


sup 1/a(Ca))/4 | f |edt 
Iy(c)S1 Jo 


= + A)|| lle. 


For Ca=1, (23) readily gives || fal] SC. || f and we obtain the con- 
dition (9). 

We shall prove the necessity of (21) by means of Theorem 1. Assume 
that (21) does not hold. Taking into account the properties of Orlicz spaces, 
it will be sufficient to show that there is a function c(xr) 20 which belongs 
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to Ly and such that Jy(Ac) for each A> 0; this will imply that 
céLy. If (21) does not hold and (uw) is finite for all OS u<+, there 
must exist a sequence b, > such that ¥(2b;)/¥(b;,) >. Let the integers 
n, be defined by 


(24) < (b,) <2, busi, - 


We may assume (passing, if necessary, to a subsequence of the 6;) that 
> +k and that ¥(2b;,)/¥(b,) = so that 


Now we define c(z) =}, for 2°™*=7<2-™, k=—1,2,:--, and c(z) 
elsewhere. Then, by (24), 


and therefore ce Ly. On the other hand, for each A>0O we have by 
definition of ¢, 


2-0, -k+1 2- 


2-n,-k 


f 
0 k=1 
= (AD, (k — 1) log 2). 
k=1 
For all sufficiently large values of k, say for k= k)—ko(A), the argument 
of © in the last sum is greater than 2b,, and by (25), the series diverges. 
Hence f W(Ac)dz = +0. 
0 
In the case when (21) is violated in such a way that W(w) becomes 


infinite for finite u, the proof is similar, but somewhat simpler. 
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ON ALGEBRAIC GROUPS AND HOMOGENEOUS SPACES * 


By AnprE WEIL. 


In a recent paper in the same JouRNAL ([4] of the bibliography ; quoted 
hereafter as AG), I gave some results on algebraic groups and transformation- 
spaces, which supplement those in my Variétés abéliennes ([3]; quoted as 
VA). Applications will now be made of that theory to somewhat more specific 
questions. In no. 1, a rather general procedure is described for obtaining, 
from a given transformation-space S with respect to a group G and from a 
suitable cycle on S, another transformation-space with respect to the same 
group. As shown in no. 2, this includes as a special case the construction 
of coset-spaces and of factor-groups; thanks to the main theorem in AG, 
these can now be defined without enlarging the groundfield, whereas such an 
enlargement was required in their construction as previously given by S. 
Nakano ([2]); except for this, we have substantially followed his method. 

The rest of this paper is chiefly devoted to “principal homogeneous 


spaces,” i.e. to those homogeneous spaces on which the group operates in a 
simply transitive manner. The pair consisting of such a space and of one 
point on it does not differ materially from a group; thus there is little incentive 
for studying those spaces as long as one is not paying any attention to the 
groundfield or if the groundfield is algebraically closed. But it can happen 
that a principal homogeneous space contains no rational point over the 
groundfield over which it is defined; an example of this is given by the plane 
curve X*-+ pY*==p* over the rational number-field, where p is a rational 
prime ; this may be considered as a principal homogeneous space with respect 
to its jacobian variety, which is the plane curve Y?=4X¥%—2%. More 
generally, Chow’s work (cf. [1]) has shown that it is not always possible to 
map a curve “canonically” into its jacobian variety by a mapping defined 
over the groundfield, but that the curve can always be so mapped into a 
suitable principal homogeneous space with respect to its jacobian variety. 
This, among other results, will be proved again here by a different method, 
which can be extended at once to a variety V of higher dimension and to its 
Albanese variety, provided the groundfield is one over which the latter is 
defined. It will also be shown that the elasses of principal homogeneous 
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spaces with respect to a commutative group can be arranged into a torsion- 
group, i.e. a group whose elements are all of finite order; and it follows at 
once from the results of no. 5 that this group must be countable if the 
groundfield is finitely generated over the prime field. There seems to be no 
reason why it should be finite, even if the groundfield is the field of rational 
numbers; a more detailed investigation of its structure, e.g. for the case of 
an elliptic curve over the field of rational numbers, would be of considerable 
interest from the point of view of the theory of diophantine equations. 


1. Let G@ be a group and S a transformation-space with respect to G, 
both defined over a field &. Let Z be either a divisor on S or a cycle on 8 
whose components have coefficients which are prime to the characteristic of 
the universal domain. We denote by sZ, for any seG, the transform of Z 
by the mapping w—su of S onto itself. Let K be an overfield of k over 
which Z is rational. Let x be generic over K on G; by Prop. 6 of the 
Appendix of AG, there is a finitely generated extension k(t) of k which is 
the smallest overfield of & over which 2Z is rational; as zZ is rational over 
K(x), we have k(t) C K(x). If 2’ is also a generic point of G over K, 
and o is the isomorphism of K(x) onto K(x’) over K which maps z onto 2’, 
o will transform xZ into 2’Z; if ¢’ is the image of ¢ under o, k(t’) will then 
be the smallest overfield of & over which 2’Z is rational, and, by Prop. 6 
of the Appendix of AG, the cycle 2’Z depends only upon ?’; in other words, 
if o, is an isomorphism of K(x) onto a field K(2,’), mapping x onto z,’ and 
t onto ¢,’, we have 2’Z—2,’Z if and only if ¢’=+#,’. In particular, take 
x = yz, with y generic over K(x) on G; then k(t’) is the smallest overfield 
of k over which yzZ is rational; as yzZ can be written as y(zZ), it is 
rational over k(y,t), so that k(t’) C k(y,t) ; similarly, 2Z can be written as 
y'(yxZ) and is therefore rational over k(y, t’), so that k(t) C k(y,t’). This 
shows that k(y,t) =k(y,?’). 

Put now z~vyz, so that we have k(t’) C K(z); take y’ generic over 
K (x,y) on G, and call + the isomorphism of K(z) onto K(y’z) over K which 
maps z onto y’z=y’yz; let t” be the image of ¢’ under r. Then #” is the 
image of ¢ under the isomorphism roo of K(x) onto K(y’yx) over K which 
maps z onto y’yz. 

If Z is such that k(t) is a regular extension of k, then we may call 7 
the locus of ¢ over k and, with the above notations, we may write t’ = g(y,t), 
where g is a mapping of GX T into T, defined over k. Then the results 
we have just proved mean that g satisfies (TG1,2) of AG, no. 2, i.e. that 
it is a normal law between G and T. Applying now the main theorem of 


AG, we get the following result: 
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Proposition 1. Let G be a group and S a transformation-space with 
respect to G, both defined over a field k. Let Z be either a divisor on S or a 
cycle on S whose components have coefficients which are prime to the charac- 
teristic. Let K be an overfield of k over which Z 1s rational; and assume 
that, if x is generic over K on G, the smallest extension k’ of k over which 
rZ is rational is a regular extension of k. Then there 1s a transformation- 
space T with respect to G, defined over k, and an everywhere defined mapping 
F of G into T, defined over K, such that the point t= F(x) ts generic over k 
on T, that k’ =k(t), and that F(ss’) =sF(s’) for all s, s’ on G. For any 
s, in G, we have F(s) =F (s’) if and only if sZ=s'Z. If Z is algebraic 
over k, one can take for T a homogeneous space with respect to G. 


The existence of a transformation-space 7’ with a generic point ¢ over & 
such that k’ k(t) has been proved above; moreover, with the same notations 
as above, we have k(t) C K(az), C K(yz), U=yt; as K(x), K(yr) 
are independent extensions of K, this shows, if K is algebraic over k, that 
k(t), k(t’) are then independent extensions of &, i.e. that T is pre-homo- 
geneous, so that, by the main theorem of AG, we may replace it by a 
birationally equivalent homogeneous space. As k(t) C K(a), we may write 
t= F(z), with F defined over K ; then we have yt = F(yz), i.e. F(yx) = yF (a), 
for y generic over K(x) on G. This may be written as F(z) =y"F (yz), 
which shows, if s is any point of G and y is taken generic over K(s) on G, 
that F is defined at s. As F is everywhere defined, the relation F(yr) = yF (a) 
implies F'(ss’) =sF(s’) for all s, s’ on G. Now, for any s, s’ on G, take x 


’ are both generic over K on G, and 


generic over K(s,s’) on G; then zs, zs 
therefore, if o is the isomorphism of K (zs) onto K(as’) over K which maps 
zs onto as’, o will map F(2s) onto F'(as’) and the cycle onto xs’Z ; then, 
as we have seen above, we have xrsZ —<«as’Z if and only if F(rs) = F(zs’) ; 
as the latter relation can be written xF'(s) —2zF(s’), so that these two 
relations are respectively equivalent to sZ—s’Z and to F(s) =F (s’), this 


completes our proof. 


2. We first apply Prop. 1 to the construction of the homogeneous space 
defined by a group @ and a subgroup of G. 


Proposition 2. Let G be a group, defined over a field k; let Z be a 
rational cycle over k on G, consisting of components with coefficient 1, and 
such that its support |Z| is a subgroup of G. Then there is a homogeneous 

| space H with respect to G, defined over k, and a rational point a over k on H, 
with the following properties: (i) if we put, for a generic x over k on G, 
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F(x) =a, the mapping F of G onto H determines a one-to-one mapping of 
the cosets of in G onto the points of H; (ii) k(x) ts separable over 
k(F(x)); (iii) if $ is a mapping of G into a variety V, defined over an 
overfield K of k, and such that $(xs) =¢(x) whenever se|Z| and x is 
generic over K(s) on G, there is a mapping wy of H into V, defined over K, 
such that 6=yoF. If |Z| is a normal subgroup of G, then one can define 
on H a group-law, defined over k, such that F is a homomorphism of G onto H. 


Z 


The “support” of a cycle was defined at the beginning of the Appendix 
of AG. The assumption on |Z{ implies that Z has one and only one com- 
ponent Z, containing e, that this is a subgroup of G, and that all the com- 
ponents of Z are cosets of Z, in G. We apply Prop. 1 to the cycle Z on 
S = G, G acting on itself by the left-translations, and to the field /; if a is 
generic over k on G, the smallest extension of & over which xZ is rational 
is contained in k(x), and hence, by AG-App., Prop. 3, Coroll., it is regular 
over k. Therefore, by Prop. 1, there is a homogeneous space with respect to 
G@, which we now call H, defined over k, and a mapping F of G into H, 
defined over k, with the properties stated in Prop. 1; in particular, F is 
everywhere defined, t== F(z) is generic over k on H, and k(t) is the smallest 
extension of & over which 2Z is rational. If we put a—F’(e), a is rational 
over k, and we have, for all seG, F(s) =sa. If s, s’ are any two points 
on G, we have sa=s‘a if and only if sZ =s’Z, i.e. if and only if s*s’Z =7; 
by the assumption on Z, the latter relation is equivalent to ss’e|Z|. Thus 
the points of H are in a one-to-one correspondence with the cosets of | Z 
in G. 

Call T the graph of F on GX H. For any be H, TN (GX |B) is the 
set of those points (s,b) which are such that sab; in particular, if x is 
generic over k on G and if we put {=F(z) =aza, [TN (GX #) is the set 
of the points (s,¢) such that sa= <a, i.e. atse| Z|; this set can be written 
as |#Z| xt. As T-(G@ Xt) is the prime rational cycle over k(t) on G XH 
with the generic point (x,t) over k(t), this shows that the prime rational 
cycle Z’ over k(t) on G with the generic point x has the same components 
as the cycle 7Z; as the latter is rational over k(t) and its components have 
the coefficient 1, this implies that Z’—2Z. As the components of the prime 
rational cycle with the generic point x over k(t) have the coefficient 1, /(.) 


must be separable (i.e. “separably generated”) over k(#). 


As to (iii), let x be generic over A on G; put {=F (x) and w—¢(): 
let w’ be any generic specialization of w over K(t); this can be extended 


to a generic specialization 2’ of 2 over K(t); we have then w’ = ¢(.’) 
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and {== F(z’), and the latter relation implies that 2’ is on | #Z |, i.e. that it 
is of the form zs with se|Z |. Let & be generic on G over K(s); we have 
= (2) ; specializing Z to x over K(s), we get (2s) = since 
both sides are defined, i.e. w’—w. This shows that w is purely inseparable 
over K(t); as it is at the same time rational over K(x) which is separable 
over K(t), it is therefore rational over K(t), and we may write w=—y(t), 
where y is a mapping of H into V, defined over K. This proves (iii). 

Finally, assume that | Z| is a normal subgroup of G; let z, y be inde- 
pendent generic points of G over k; put t=F (x), u=F(y). We have 
F(xy) =«F(y) this is a function of a, defined over k(u). If se|Z|, 
we have «sy—ays’ with s’=y'sye|Z|, and therefore F(xsy) =F (zy) ; 
by (iii) applied to the mapping xu of G into H and to K —k(w), this 
implies that F'(xy) is rational over k(u,t). Therefore the mapping u—> zu 
of H into H is defined over &(t); on the other hand, if k’ is any field of 
definition for that mapping, containing &, the image za—t of a by it is 
rational over k’, so that k(t) C k’; thus k(t) is the smallest field of definition 
for the mapping u—>zu. This shows that G is not operating faithfully on H; 
applying Prop. 2 of AG, no. 3, to G and H, we see that we can define on H 
a normal law of composition f such that F(zry) —f(F(2x),F(y)). By the 
main theorem of AG, we can then replace H by a birationally equivalent 
group H’, defined over k, with a mapping F” of G into H’, also defined over 
k, such that F’ (ry) = F’(x)F’(y), F(x) and F’(zx) being corresponding 
generic points of H and H’ over k when z is generic over k on G. As usual, 
from the relation F’(x) = F’(y)-1F’(yx) which holds for z, y generic and 
independent over k, we deduce that F” is everywhere defined‘; therefore, if 
@ is made to operate on H’ by the law (2,w) > F’(x)w for x, w generic and 
independent over k on G and H’, H’ is a homogeneous space with respect 
to G. But then the unicity assertion in the main theorem of AG can be 
applied to H and H’ and shows that they are biregularly equivalent; in other 
words, H itself, with the law f, is a group. This completes the proof. 

It is easily seen that the pair (H,a) is uniquely determined, up to an 
isomorphism, by the conditions (i), (ii), (iii) in Prop. 2; in other words, if H’ 
and a’ have similar properties, there is an everywhere biregular birational cor- 
respondence between H and H’ which maps a onto a’ and transforms the law 
of composition between G and H into the law between G and H’. The space 
H may be called the coset-space determined by G and Z, and may be denoted 
by G/Z; if |Z| is a normal subgroup of G, the space H, with the group-law 


* This is Theorem 1 of Nakano ([2]). 
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determined by Prop. 2, is called the quotient-group (or factor-group) of @ 
by Z, and is denoted by G/Z. 


3. Before making another application of Prop. 1, we will introduce 
a new condition which a law of composition may satisfy. Let V, W be two 
varieties, g a mapping of V & W into W, and & a field of definition for V, 
W and gq; consider the following condition: 


(TG1’) If x, u are independent generic points of V, W over k, and 
v=g(zr,u), then k(z,u) =k(az,v) =k(u,v). 


The condition k(z,u) —k(z,v) is equivalent to (TG1) of AG, no. 2. 
The condition k(z,u) —=k(u,v) implies that the dimension of V, which is 
the dimension of x over k(u), is the same as that of v over k(u), and there- 
fore at most that of W; if the dimensions of V and of W are the same, this 
implies that v is generic over k(u) on W, which is condition (H) of AG, 
no. 2. Let k’ be any field of definition containing k& for the birational 
correspondence w>v=—g(z,u) between W and itself; if uw is taken generic 
over k’(x) on W, we have k’(w) =k’ (v) ; since k(x) C k’(u,v) by (TG1’), 
we have k(x) Ck’(u). Taking wu’ generic on W over k’(z,u), we get in 
the same manner k(z) Ck’(u’). As k’(u), k’(u’) are independent regular 
extensions of k’, their intersection is k’, so that k(x) C k’. This shows that 
(TG 1’) implies (TG3). In view of the results of AG, end of no. 3, this 
shows that, if g satisfies (TG1’) and (TG2’), or if two mappings f, g of 
VX V into V and of V X W into W are given and satisfy (TG1’.2), then 
V is a pre-group and W a pre-transformation space, and V operates faithfully 
on W. 


If a pre-group V and a pre-transformation space W satisfy (TG1’), f 


we say that W is a pre-principal space with respect to V; if at the same time 
V and W have the same dimension, so that, as we have shown, W is pre- 
homogeneous with respect to V, we also say that V is simply pre-transitiv 
on W. 

Let W be a pre-principal space with respect to a pre-group V; by the 
main theorem of AG, we can construct a group G and a transformation-space 
S, birationally equivalent to V, W and defined over the same field &; then 
S is also pre-principal with respect to G. Let T be the locus of (wu, <u) 
over k on S XS, xz and wu being independent generic points of G, S over k; 
put (u,zu); then (TG1’) implies that C k(t), i.e. that we may 
write z= $(t), where ¢ is a mapping of T into G, defined over k; conversely, 
if this is so for a transformation-space S with respect to G, 9 is pre-principal. 
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The space S will be called a principal space with respect to G if, for z, u 
generic and independent over k on G, 8 and for t= (u, rw), we have = $(¢) 
where ¢ is an everywhere defined mapping, defined over k, of the locus T 
of ¢ over & into the group G. If at the same time § is homogeneous, it will 
be called a principal homogeneous space with respect to G. 


Proposition 3. Let S be a pre-principal transformation-space with 
respect to a group G, both being defined over a field k. Then there ts a k-open 
subset P of S which is a principal transformation-space with respect to G; 
if G and S have the same dimension, P is uniquely determined and is 


homogeneous. 


Let 7 and ¢ be defined as above; call F the k-closed subset of TJ’ where 
o is not defined. We first show that, if (a,b) is in F, (sa,s’b) is in F for 
all s, s’ in G. In fact, take x, w generic and independent over k(s,s’) on 
G, 8; put v= zu, —su, v, = then we have and 
@,, U, are generic and independent over k(s,s’) on G, S, so that (u,v) and 
(u,.v,) are generic points of T over k(s, s’), and that z= ¢(u, v), 21 = f(t, V1) 
by the definition of ¢; this gives 

(u,v) s’v)s. 
If (a,b) is in T, it is a specialization of (u,v) over k(s,s’), and therefore 
(sa, s’b) is also in 7’; then the above relation shows that ¢ is defined at (a,b) 
if it is defined at (sa,s’b), i.e. that (a,b) ¢«F implies (sa,s’b) F. 

As (e,w) is a specialization of (z,u) over k, e being the neutral element 
of G, T contains the diagonal A of SS. As the projection of A on either 
factor of S < S is everywhere biregular, the projection of the k-closed subset 
F(A of A onto S is a k-closed subset #” of S, consisting of the points ae 
such that ¢ is not defined at (a,a). From what we have proved above, it 
follows that, if ae F’, sae F’ for all seG. For the same reason, if a is in 
S— FH’, then d(a,sa) is defined for all se @; as (a,sa,s) is then a specializa- 
tion of (u,ru,2) over k, and zu), this shows that ¢(a,sa) for 
all ae S—F” and all seG, and therefore k(a,s) —k(a,sa); in particular, 
if x is generic over k(a) on G, the locus of za over k(a) has a dimension equal 
to that of G. 

If @ is complete, every specialization (a,b) of (u,zu) over k can be 
extended to a specialization (a,b,s) of (u,zu,z) over k, so that b—sa; in 
other words, every point of 7 must be of the form (a,sa), with aeS and 
seG; then it follows from what we have proved above that such a point 
cannot be in F unless a and sa are in F’. Without attempting to decide 
whether this is still so in the general case, we shall merely show that, if u 
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is generic over k on S and (u,w’) is in F, then wu’ must be in F’. In fact, 
suppose that this is not so; take x generic over k(u,u’) on G; call X, X’ 
the loci of zu, zu’ over k(u,u’) on S; by what we have proved above, they 
have the same dimension, which is that of G. By F-VI;, Th. 11, we have 
TA (uxX8S)=u XX; at the same time, since we have shown that (u, ru’) 
is in T, ux X’ is contained in TM (uX 8S); as X and X’ have the same 
dimension, this implies that XY —X’. But, as we have shown, since (u,w’) 
is in F, (u,zu’) must be in F, and therefore, since F is k-closed, u < X’ 
must be contained in F; as X =X’, this implies that F contains (wu, zw), 
which is generic on J over k, and contradicts the definition of F. One may 
observe that, if S is pre-homogeneous, this again shows that (a,b) cannot be 
in F unless a, 6 are in F’; for, if a¢F’ and z is generic on G over k(a,)), 
za has then over k(a) a dimension equal to that of G, and therefore is generic 
on § over k(a) since in the present case the dimensions of S and G are equal; 
then if (a,b) is in F, so is (za,b), and so b must be in F’. 

Now replace first S by S—F’; as F” is mapped onto itself by all 
operations of G, S—F’ is again a transformation-space with respect to G, 
defined over /, and satisfies our other assumptions. Writing again S instead 
of S— F’, we see that it is enough to prove our result under the additional 
assumption that 7’ =. If G is complete or S is prehomogeneous, this 
already implies that § is principal. Otherwise we observe that, since 7’ is 
also the locus of (2-u,w) over & and since we have ¢(z*u,u) =a, T and F 
are mapped onto themselves by the mapping (u,v) > (v,u) of SX 8 onto 
itself. Call now F” the “ projection” of F on either factor of 8 & S (in the 
sense of F-IV, and F-VIIs;, i.e. the closure of the set-theoretic projection) ; 
this will be the same, whether we project F' onto the first or the second factor, 
and it is not S by what we have proved above, since F’ is empty; it is there- 
fore a k-closed subset of S. By what we have proved, F” is mapped onto 
itself by all operations of G. Then S—F” is the principal space whose 
existence was to be proved. 

Finally, assume that the space S from which we first started was pre- 
homogeneous; this means that T—S XS. Let a, b be any two points in 
S — F”; then, if z is generic on G over k(a,b), za, xb are generic on S§ over 
k(a,b), and so there is an isomorphism o of k(a,b,za) onto k(a,b,xb) over 
k(a,b), mapping za onto xb; then we have i.e. This 
shows that S— F” is homogeneous, and also that an open subset of S which 
is a transformation-space for G cannot contain a point of S—F”’ without 
containing S—F’. Therefore S— F” is the only open subset of § which is a 
principal space with respect to G. 
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If S is a principal homogeneous space, the mapping ¢ of 77—=SXS8 
into G which has been defined above will be called the canonical mapping 
of SXS into G. For any a, b on S and s on G, the relations ) =sa, 
s= (a,b) are equivalent; in particular, for any a on S and for z generic 
over k(a) on G, the mapping z->2za—v of G into S has the inverse 
v—->2x=—=4¢(a,v); as both are everywhere defined, this is therefore an every- 
where biregular mapping of G onto 8, defined over &(a). In particular, if 
there is at least one rational point a over & on S, S is biregularly equivalent 
to G over k. 


4. Let G be a group, V and W two varieties, and F a mapping of 
VX W into G, all defined over a field &. We may consider W XG as a 
transformation-space with respect to G, the law of composition between them 
being (x, (N,y))—> (N, zy) for any N in W and any z, yin G. We now 
apply Prop. 1 of no. 1 to the case when we take for S this transformation- 
space WX G and for Z the graph of the mapping N->F(M,N) of W into 
G, where M, N are independent generic points of V, W over &. We must 
then consider the smallest field of definition k’ containing & for the mapping 
N—zF(M,N) of W into G, where x is generic over k(M,N) on G. As 
this mapping is defined over k(z,M), k’ is a regular extension of k, con- 
tained in k(z,M). Then Prop. 1 shows that we may write k’ —k(u), where 
u is a generic point over & of a transformation-space U with respect to G; 
as k(u) Ck(x,M), we may write w=—f(z,M), where f is a mapping of 
GX V into U, defined over &; moreover, as the mapping tz—>f(z,M) of G 
into U is no other than the mapping F defined in Prop. 1, we see that f is 
defined at every point (s,M) of GX M, and that f(ss’, M) =sf(s’, M) ; taking 
s’=e, and writing f(M) instead of f(e,M), this gives f(s,M) —sf(M), 
and in particular u—<af(M). As the mapping N >2zF(M, WN) is defined over 
k(u), eF (M,N) is rational over k(u,N): similarly, if y is generic over 
k(z,M,N) on G, the mapping N— yrF(M,N) is defined over k(yu), and 
so yxF' (M,N) is rational over k(yu,N). As we can write 


y = 


this shows that y is rational over k(u,yu,N). If N’ is generic on W over 
k(a,y, M,N), y must then also be rational over k(u, yu, N’); thus k(y) is 
contained in k(u,yu,N) and in k(u,yu,N’); as these are independent 
regular extensions of k(u, yw), their intersection is k(u, yu), and so we have 
k(y) Ck(u,yu). This means that U is a pre-principal space and may 
therefore, by Prop. 3, be replaced by a principal space, birationally equi- 
valent to it. 


‘ 
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The most interesting case is that in which there are two mappings 
F,, F. of V, W into G, defined over some overfield K of k, such that 
F(M,N) =F,(M)F.(N) for M, N generic and independent over K on 
V, W; by the corollary of Th. 7, VA-18, this is always so whenever @ is 
an abelian variety. Take x generic over K(M,N) on G, and put z=zF,(M); 
then the mapping N->2F(M,N) =2zF.(N) is defined over K(z), so that 
k(u) C K(z). As x, M are generic and independent over K (N) on G, V, u 
is then generic over K(N) on U, and the dimension of U is that of u over K; 
the relation K(u) C K(z) shows that this is at most the dimension of G. 
Therefore U is pre-homogeneous and may be taken to be a principal homo- 
geneous space with respect to G. Moreover, we may write u—®(z), where 
® is a mapping of G into U, defined over K. If we substitute yz for z, with 
y generic over K(M,N,z) on G, z is replaced by yz, and u by yu; this gives 
&(yz) = yb(z), which may be written as @(z) =y'*®(yz) and thus shows 
that ® is everywhere defined. Putting now a=(e), we see that a is rational 
over K and that u—za, i.e. f(M) =F ,(M)a. Put now =F.(N) a; 
g is a mapping of W into U, defined over K. As we have also 
g(N) =F(M,N)“*f(M), g is also defined over the field &(M), and there- 
fore also over k(M’) if M’ is another generic point of V over K; if we take 
M, M’ generic and independent over K on V, k(M) and k(M’) are indepen- 
dent regular extensions of k&, so that their intersection is k; hence g is defined 
over k. Thus we have proved the following result: 


Proposition 4. Let G be a group, V and W two varieties and F a 
mapping of V & W into G, all defined over k. Assume that there are two 
mappings F,, F, of V, W into G, defined over some overfield K of k, such 
that F(M,N)=F,(M)F.(N) for M, N generic and independent over K 
on V, W. Then there is a principal homogeneous space U with respect to G, 
and two mappings f, g of V, W into U, all defined over k, such that 
f(M) =F(M,N)g(N), i.e. F(M,N) =¢(g(N),f(M)), where is the 
canonical mapping of U X U into G. 


CoroLLary. Notations being as in Prop. 4, U, f and g are uniquely 
determined by G, V, W and F, up to an isomorphism. 


In fact, assume that U’, f’, g’ have similar properties; then we have 
(M,N) = where ¢’ is the canonical mapping for U’. 
This shows that the mapping N—2F(M,N) is defined over k(u’) with 
wu’ = 2f'(M); thus, if we put w—-2f(M) as before, we have k(u) C k(w’) 
and may write u—=y(wu’), where y is a mapping of U’ into U, defined over k. 
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Replacing x by yz, with y generic on G over k(M, N,2x), we get yo=y(yu’) 5 
from this we conclude, in the usual manner, that y is everywhere defined. 
Take any point a’ on U’, and put a=y(a’); as we have za=y(aa’), and 


as the mappings r—>za, <a’ are everywhere biregular mappings of G 
onto U and U’, defined over k(a’), we see that y is an everywhere biregular 
mapping of U’ onto U. Moreover, we have y(u’) =2y(f’(M)), and there- 
fore f=wof’; from this one easily concludes that g=yog’. This proves 
our assertion. 


5. Let @ be a group, defined over a field &. We will now prove that 
the classes of principal homogeneous spaces with respect to G, for birational 
equivalence over k, form a set. In fact, let x, y be independent generic 
points of G over k; let « be the isomorphism of &(.) onto k(yx) over & which 
maps z« onto yz. Let H be any principal homogeneous space with respect 
to (, defined over k; let a be an algebraic point over & on H, and put u= za, 
so that wu is generic over k on H. Then k(u) is a regular extension of & 
contained in &(x) and such that =k(2) ; moreover, we have 


k(y,u) =k(y, u?) =k(u, u7) 


since u7 == yu. Conversely, let k(w) be any such extension of k, and call U 
the locus of u over k; then we may write u% =g(y,u), where g is a mapping 
of G & U into U, defined over i; and one verifies at once that this makes 
into a pre-principal pre-homogeneous space with respect to G, and thus deter- 
mines uniquely a class of birationally equivalent principal homogeneous spaces 
with respect to G. As every such class is determined by at least one such 
extension, this shows that these classes form a set. 

If @ is commutative, one can define canonically a commutative group- 
structure on the set of classes of principal homogeneous spaces with respect 
to G. In order to do this, we first observe that, if H is any transformation- 
space over a commutative group G, then the law (2,u)—>2z-1u, for re G, 
we HI, defines on H a structure of transformation-space with respect to G; 
this will be called the opposite transformation-space to H and will be denoted 
by H-; it is a principal homogeneous space if H is such. 


Proposition 5. Let G be a commutative group, defined over a field k. 
Let H;, for 1 Sisn, be principal homogeneous spaces with respect to G, defined 
over k. Then there is a principal homogeneous space H with respect to G, 
defined over k, and an everywhere defined mapping f of H, X H.X-::X A, 
into H, defined over k, such that 
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Moreover, H and f are uniquely determined up to 


for all seG and aye Hj. 
an tsomorphism of H. 


Put V=W=dH, X H2X%:- +X Hn; call ¢; the canonical mapping of 
H; X H, into G, so that b; = sa; is equivalent to s = ¢;(a;, b;) for a;, b; in H; 
and s in Let w= and be two points 
of V; put 


F(u,v) = fi (Ui, Vi), 


where the right-hand side has a meaning since G is commutative. On each 


H;, choose a point a;, and put a= (a,- - -,dn). We have 


hi (Ui, Vi) = Pi (Gi, Vi) Gi (Qs, Ui) 


for all i, as one verifies at once, and therefore, again because of the commu- 
tativity of G: 


F (u,v) = F(a,v)F(a,u)7. J 
Thus the assumptions of Prop. 4 are satisfied, so that there is a principal 
homogeneous space U and two mappings f, g of V into U, all defined over f, 
such that 0 
(1) f(u)=F(u,v)g(v), F(u,v) = $(9(v),f(u)), 
( 
where ¢ is the canonical mapping of U x U into G. Take any point b on V, 
and take v generic over k(b) on V; as F is defined at (b,v), the relation (1) tl 
shows that f is defined at b. Thus f is everywhere defined. As F(u,w) =e, f' 
the relation (1) gives g(u)=—f(u), i.e. If are any 
elements of G, and we put s=s,- - -s, and w’ = (8,U1,° *,SnUn), we have 
F(u’,v) (u,v) and therefore, by (1), f(w’) =stf(w). If we now a 
put H=U-, i.e. if we take for H the opposite space to U, H and f will 
have the properties stated in Prop. 5. 
Let us now assume that H and f have similar properties; put U = H-. of 
Put z=F(u,v)“f(u), the multiplication in the right-hand side being that th 
of U. If the s, s and wu’ have the same meaning as above, we have 
f(u’) =s*f(u), so that Z does not change if one replaces u,v by w’,v. de 
Therefore &(Z) is contained both in k(u,v) and in k(u’,v). If the s; have 
been taken generic and independent over k(u,v) on G, k(u,v) and k(u’.v) - 
will be independent regular extensions of &(v); this gives &(Z) C k(v), cla 
so that we may write Z=g9(v), with g defined over &. Then we have § 


f(u) =F (u,v)g(v) ; by the corollary of Prop. 4, 0, # and g must then be 


i 
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the same as U, f and f, respectively, except for an isomorphism of U onto U. 
This proves the assertion about unicity in Prop. 5. 

In Prop. 5, take n= 2; call the classes of Hi, Hz, and denote 
by #: + #, the class of H. This defines on the set of classes of principal 
homogeneous spaces with respect to G a commutative group-structure. In 
fact, commutativity is obvious. Call 9, the class of G, and therefore of all 
principal homogeneous spaces with respect to G which have a rational point 
over &. For any principal homogeneous space H with respect to G, the 
mapping f(z,u) au of GX H into H satisfies the condition of Prop. 5; 
therefore we have = & for all classes If is the canonical 
mapping of H x H into G, then ¢, considered as a mapping of H X H- into 
G, satisfies the condition of Prop. 5; therefore, if #f- is the class of H-, 
we have H+ H,. Finally, let H:, H., Hz; be three principal homo- 
geneous spaces with respect to G; apply Prop. 5 successively to the following 
spaces: (a) to H,, H>, obtaining a space H,, and a mapping f2; (b) to Hie, 
H,, obtaining H’, f’; (c) to H2, Hs, obtaining Hos, fos; (d) to Hi, Aes, 
obtaining H”, f’; (e) to H,, H., Hs, obtaining H, f. Then the two mappings 


(fre (ti; Ue), Us); (ta, fos (Ue, Us) ) 


of H, X H. x H, into H’, H” satisfy the same condition as the mapping f. 
By the unicity assertion of Prop. 5, this shows that H’, H” are isomorphic 
to H. This means that the addition 9, + 9, is associative. 

One proves quite similarly, by induction on n, that if 9% and H, are 
the classes of the spaces H, H; in Prop. 5, then # = = Hi. In fact, let H’, 


f’ be the space and the mapping obtained by applying Prop. 5 to H,,: - -, Hn, 
so that HM’ +---+ H,, by the induction assumption; and let H”, 
f’ be the space and the mapping obtained by applying Prop. 5 to H’, Hy, 
so that #” — H’ + H, by definition. Then the mapping 


of H, <---> H, into H” has the properties stated for f in Prop. 5, so 
that, by the unicity assertion in Prop. 5, H” is isomorphic to H. 

From this one deduces that every element # of the group we have just 
described is of finite order. In fact, take on a space H of class # any 


positive cycle S\a; of dimension 0, rational over &. Call H, any space of 

class n& ; then there is a mapping f(w,- - -,un) of the product of n factor: 

equal to H into H,, with the properties stated in Prop. 5. From the unicity 

assertion in Prop. 5, it follows that any permutation of the wu, will change f 
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into sf, with seG; as f is everywhere defined, we see that se by taking 
therefore f is a symmetric function, so that f(ai,- -,a,) 
is rational by the main theorem on symmetric functions (VA-7, Th. 1). So 
H,, has a rational point over &, and is therefore isomorphic to G. 

Now, & being as before, put Hp —G and take, for each integer n A(0, 
a space H, of class n& so that all the H, are disjoint. On the set 6 = U H, 


(which is of course not an algebraic variety), we will define a commutative 
group-law f (in the sense of group-theory, not of algebraic geometry) such 
that H, = G will be a subgroup of & and that f induces on Hy, X Hy, for all 
m, n, a mapping fmn of Hm xX H, into Hm» satisfying the conditions in 
Prop. 5. As there is such a mapping fm,» for each m, n, and as it is uniquely 
determined up to an automorphism of Hm,» (i.e. up to left-multiplication by 
a rational point of G), we merely have to choose the fm,» so that the mapping 
f of G X G into G which coincides with fm» on Hm X H, for all m, n satisfies 
the axioms for groups; we do this as follows. For any n, we take fo.n(x, wu) = ru 
for ceG, we H,. We choose f_,, and, for all n>0, fn. and so as to 


satisfy the conditions in Prop. 5. Now, for elements u,- - -,Un.: of H, in é 
any number, we define u,- - Uns; inductively as being equal to u, for n=0 | 
and to fini(Ui° for n=1; similarly, for elements Uni f 
of H_,, we define v,- - as equal to v, for n= 0 and to fin,-1(U1° * Uny Uns1) f 
for n=1. It is then easily seen that, whenever m, n are both > 0, there is I 
one and only one way of choosing fm,, so that it satisfies the condition ; 
when the u; are in H,; we determine f_m,, similarly, using H_, instead of H,. S 
Finally, for m= n > 0, we choose fim,n and f_m,n So as to satisfy the conditions 
respectively, the u; being any elements of H, and the 4 any elements of A... : 
It is then a trivial matter to verify that these choices of the fm, satisfy all p 
the requirements for a commutative group-law on ©. 1 
The points on the H, which are rational over k form a subgroup g of &. F 
As we have shown that there are such points for some n+0, there is 4 a 
smallest n >0 for which there is such a point aeH,; this n is the order 5 
of & in the group of classes of principal homogeneous spaces with respect t} 


to G. Then g is the direct product of the group go>=gM@ of rational 
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points over & on G and of the infinite cyclic group y generated by a. 
quotient-group @/y may be described as an algebraic group consisting of n 
components respectively isomorphic to Hyp =G, Hau. 


6. Proposition 6. Let A be an abelian variety and H a principal 
homogeneous space with respect to A, both being defined over a field k. Let 
V,.: °°, Vn be varieties, and F a mapping of Vi X- + -X Vn into H, all these 
being defined over k. Then there is for each 1 a principal homogeneous 
space H; with respect to A and a mapping F; of V; into H;, H; and F; being 
defined over k, and there is a mapping f of H, X-- +X Hn into H with the 
properties stated in Prop. 5, such that, for (M,,- > -,Mn) generic over k on 
ViX:°*:X Va, we have 


Moreover, all these are uniquely determined up to tsomorphisms. 


For n—1, there is nothing to prove. If the assertion is proved for 
a product of two factors, then this can be applied to the product 
ViX (V2X%:- +X Vn) of Vi and V2X-- +X Va, so that the general case 
follows by induction on n. Thus it is enough to treat the case of two 
factors V, W and of a mapping F of V X W into H. Call ¢ the canonical 
mapping of H XH into A; let (M,N) and (M’,N’) be two independent 
generic points of V & W over k; and put 


y=¢(F(M,N),F(M,N’)). 


so that we have 
ry = (M,N), F(M’,N’)). 


As the mapping ((M/, M’), N’) ~z of (V X V) X W into A has the constant 
value e on the variety (1/,M) X W, Th. 7 of VA-18 shows that z is rational 
over k(M,M’); for a similar reason, y must be rational over k(N, WN’); in 
other words, there are mappings ®, ¥ of V K V, W X W into A, both defined 
over k, such that z= M’) and y=W(N,N’). By the corollary of Th. % 
of VA-18, ® and © satisfy the assumptions of Prop. 4, so that there are two 
principal. homogeneous spaces U,, Uz with respect to A, mappings F,, G, of 
V into U, and mappings F., G, of W into U2, all defined over &, such that 
F,(M) =2G,(M’), F.(N) =yG.(N’) ; moreover, as 6(M,M), ¥(N,N) are 
defined and equal to e, we have G,—F,, G.—F,. Now call H;, Hz the 
spaces respectively opposite to U,, U2, and apply Prop. 5 to H,, H;: let H be 
the principal homogeneous space and f the mapping of H, X H. into A with 


= 
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the properties stated in that proposition. Put F(M, N) =f(Fi(M), F.N)). 
As H,, H. are opposite to U;, U2, we have 


F,(M’)=2F,(M), F2(N’) =yF2(N), 


multiplication in the right-hand sides being understood in the sense of H,, H,. 
By the definition of f, we have then: 


N’) — (ay) (M,N), 


while, as we have seen above, the same relation holds if F is substituted for F. 
But then, as the corollary of Th. 7, VA-18, shows that the mapping 


((M’,N’), (M,N)) > 


of (V X W) X (V X W) into A satifies the condition of Prop. 4, the corollary 
of Prop. 4 shows that H, F must be the same as H, F except for an iso- 
morphism of H onto H. Then, replacing f by a mapping f of H, X H, into 
H by means of that isomorphism, we have the spaces H,, H, and the mappings 
F,, F., f whose existence was asserted in our proposition. 

As to unicity, assume that there are spaces H,*, H.* and mappings F’,*, 
F,*, f* with the same properties. Then, x being defined as above, or equi- 
valently by F(M’, N’) =aF(M, N’), we have 


f*(F,*(M’), F2*(N’)) = xf*(F,*(M), F2*(N’)) = F.*(N’)) 


and therefore F,*(M’) since the mapping u->f*(u,v) of H,* 
into H is, as easily seen, an everywhere biregular mapping of H,* onto H. 
But then the corollary of Prop. 4, applied to the mapping (M’,M)—-z2 of 
V X V into A, shows that H,*, F,* are the same as H,, F, except for an 
isomorphism. The same argument applied to y instead of zx shows that 
H,, F, are uniquely determined up to an isomorphism. Then Prop. 5 shows 
that f is uniquely determined. This completes the proof. 


7. The foregoing results will now be applied to the theory of Jacobian 
varieties. As in VA-35, we consider a complete non-singular curve I of 
genus g > 0, defined over a field k. Ifa is any divisor on T, Prop. 6 of the 
Appendix of AG shows that there is a smallest field containing k over which 
a is rational; this field will be denoted by &(a). In particular, if M,.---.M, 
are independent generic points of T over & and if we put m= > M,, then, 


by VA-4, Lemma 1, &(m) is the field &(M,,- - -,M,), of symmetric func- 
tions of M,,---,M, defined over &, i.e. the subfield of k(M,,- - -,M,) 
consisting of those elements which are invariant under all permutations of 
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M,,- - -,M,; such a divisor m will be called generic over k. As k(m) is a 
regular extension of k, we may write it as k(w), where wu is a generic point of 
a variety W over k, and we may write u=F'(M,,: - -,M,), with F defined 
over k; as F is symmetric in the M;, this may also be written as u—F'(m). 
Now let the Ni, P;, for 11g, be 2g independent generic points of T 
over k(m) ; and put: 


this being a generic point over & of the product V=ITX---XT of 2g 
factors equal to T. By VA-35, Lemma 11, there is a positive divisor m’ 


g g 
on I linearly equivalent to m+ >} Ni—> Pi, and it is uniquely determined 


4=1 4=1 
and such that k(z,m) =k(a,m’); this implies that it is generic over k(z). 
Then, if we write w’—=F(im’), we have k(z,u) =k(a,u’); we may thus 
write u’ = g(z,u), where g is a mapping of V X W into W which satisfies 
1). 

We now show that this mapping satisfies the condition (TG 2’) of AG, 
no. 3. Prop. 2, so that this proposition may be applied to it. In fact, let y be 
a generic point of V over k(z,u) ; we may write 

Then the point u” =g(y,wu’) will be determined by wu” =F (m”’), where m” 
is the positive divisor linearly equivalent to m’+>3Q:—> RR; Applying 
i 
again VA-35, Lemma 11, we see that there is a positive divisor > S; linearly 
i 
equivalent to } Ni— |} Pi: + } Qi, and that it is generic over k(y,u) and 
rational over &k(z,y). But then m’ is linearly equivalent to m+ S:—> Ri, 
i 
which shows that, if we put 
z is generic on V over k(w) and that we have uw”—g(z,u). This shows 
that g satisfies (TG 2’). Applying Prop. 2 of AG, no. 3 and the main theorem 
of AG, we see that there is a group J, a normal law g between J and W, and 
a mapping ¢ of V into J such that g(z,u) =g(¢(z),u) for z, wu generic 


and independent over & on V, W. 
Put now K=k(P,,:--,P n=> Mi, and 


Since the P;, Q;, N; are generic and independent over &, Lemma 11 of VA-35 


| 
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shows that w is generic over K on V. At the same time, the linear equiva- 
lence by which the S; were defined shows at once that g(z,u) = g(w,u) 
for w generic over k(z,w), and therefore 9(¢(z),u) =g(¢(w),u). Since 
J, by definition, operates faithfully on W, this implies ¢(x) —¢(w) ; as w 
is generic over K on V, this shows that ¢(z) is generic over K on J. It will 
now be shown that K(¢(z))—K(n). In fact, if wu and w’ —g(z,u) are as 
before, uw’ is rational over K(m,n) —K(u,n) since the divisor mt’ is so by 
Lemma 11 of VA-35; therefore the mapping u—w’ is defined over K(n), 
so that K(¢(xz)) CK(n). Put now K’—K(¢(z)), so that wu’ is rational 
over K’(u); then m and mm’ are both rational over K’(u). But Lemma 11 
of VA-35 shows that n is rational over K(m,m’) and therefore over K’(u). 
If wu, is a generic point of W over K’(w), n will also be rational over K’(u;) ; 
as K’(u), K’(u,) are independent regular extensions of K’, this implies 
that n is rational over K’. 

But now a comparison with the construction of the jacobian variety 
given in VA-36 shows that the latter coincides with our J over a suitably 
extended groundfield; more precisely, substituting K for k, } P; for a, u for 


m and ¢(z) for z in the treatment given in VA-36, we get the same law of 
composition for the field K(¢(z)) as has been defined above. Alternatively, 
one may also reason as follows. Let J, be the jacobian variety as defined 
in VA; let ¢, be the “canonical mapping” of I into J,, also according to the 
definition of VA-37 (which will soon be replaced by a more appropriate one) ; 
let K, be an overfield of the field K defined above, over which J, and 4, 
are defined; take n generic over K,; put t= ¢(z), x being as above, and 
z=S[¢,(n)]. As we have then K,(x) = K,(z) = K,(n), the mapping rz 
defines a birational correspondence between J and J,, defined over K,. If 
we write it as z—=f(x), f is everywhere defined by VA-15, Th. 6; and this, 
by the results at the beginning of VA-19, must then be of the form 
f(x) =fo(z) +a, where a—f(e) and where fy is a homomorphism, so that 
(using the additive notation on J, and the multiplicative notation on J) we 
have fo(zz’) =fo(z) + f.(2’). But then f, is again a birational correspon- 
dence, and, if g is the inverse mapping to fo, we have g(z+ 2’) =g(z)g(2’) 
for z, 2’ generic and independent over K, on J,. This can be written as 
g(z) =9(2+2)g(2)7; if then z, is any point of J,, and we take z’ generic 
on J, over K,(2;), this shows that g is defined at z,. As fo, g are everywhere 
defined. they determine an isomorphism between J and J. 

One could also, without making use of the results of VA, verify directly 
(for instance by making use of the criterion for the completeness of a group 


| 
| 
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given by VA-33, Th. 16) that the group J we have constructed here is com- 
plete and is therefore an abelian variety. Then the results we have proved 
above, combined with the corollary of Th. 7, VA-18, show at once that J 
has the properties stated in VA-36, Th. 18; since the whole theory of the 
jacobian variety depends upon nothing else, and these properties (as proved 
in VA-37) are characteristic of the jacobian variety, this would suffice for a 
complete treatment. 

From this discussion, we conclude that J is an abelian variety. Now 
apply Prop. 6 to the mapping ¢ of V into J; this defines 2g mappings of T 
into principal homogeneous spaces with respect to J, all defined over k. 
As ¢ is symmetric in the N; and also in the Pj, the unicity assertion in 
Prop. 6 shows at once that the first g mappings must coincide, and that the 
last g mappings must coincide; call F, F’ these mappings, and H, H’ the 
spaces into which they map [. Now, notations being the same as above in 
no. 6, put 

a = (P,,- *,N,). 


Then we have, always with the same notations as before, u—g(z’,u’), and 
therefore ¢(2’) —¢(zr)-1. This, combined with the unicity assertion in 
Prop. 6, shows at once that H’ is the opposite space to H while F” must be 
the same as F’. 

We now embed J and H, in the manner explained at the end of no. 5, 
into a commutative group © consisting of principal homogeneous spaces H, 
with respect to J, all defined over k, with Hp J, H,—H, in such a way 
that G/J is an infinite cyclic group, that the H, are the cosets of J in 
and that the group-law in & induces on H» X Ho, for all m,n, a mapping 
of Hn X H, into Hm,» defined over & and satisfying the conditions in Prop. 5. 
At the same time, we change from the multiplicative to the additive notation, 
not only in J but also in &. With this notation, we have, if z, ¢(z) and F 
have the same meaning as before, 


Let us now extend the mapping F into a homomorphism of the group of 
divisors on T into G, by putting F(a) = > n,F(A,) for any divisor a= > njAj. 


so that F(a) eH, if n is the degree of a; in particular, F(a) is in J if and 
only if a is of degree 0. Ifa is any point of H and M is a generic point of I 
over k(a), the mapping M—F(M)—a of T into J, which is defined over 
k(a), is a “canonical mapping” in the sense of VA-37; naturally it is only 
defined up to an additive constant; and, by the unicity assertion in Prop. 6. 
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no such mapping can be defined over & unless H is isomorphic to G, i.e. unless 
H has a rational point over k. From Th. 19 of VA-38, one deduces imme- 
diately that a divisor a on T is linearly equivalent to 0 if and only if 
F(a) =0. 

In other words, the homomorphism a— F(a) determines an isomorphism 
of the group of all divisor-classes (of any degree) on T onto the group ©. 
From the foregoing results, one concludes easily that these properties are 
characteristic for © and F, up to isomorphisms on J and its cosets H, in ©. 
One may call © the Jacobian group of T, and F the canonical mapping of TI, 
and of the group of divisors on I, into the Jacobian group. In substance, the 
construction of the varieties H, has already been given by Chow (in [1]) 
by a method belonging to projective geometry. 
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A PROOF OF A THEOREM OF MEYER ON INDEFINITE 
TERNARY QUADRATIC FORMS.* 


By Burton W. Jones and DonaLp MarsH. 


1. Introduction. We shall be considering in this paper primitive 


3 
indefinite ternary quadratic forms f = > aj;jv;z; where the aj; are integers with 
4,j=1 
g.c.d.1. We call J the g.c.d. of the two-by-two minor determinants of (4aj;) 


and det(ai;) —J?K defines an invariant K. (Classically these invariants 
were denoted by 2 and A). The purpose of this paper is to give a proof of 
the following theorem of A. Meyer [4] which is very fundamental in the 
theory of quadratic forms. 


THrorEM A. If two primitive indefinite ternary quadratic forms are 
in the same genus and if their invariants J and K have g.c.d 1 or 2 and 
neither is divisible by 4, then the two forms are equivalent. 


Meyer gave what he called a proof of this theorem but which contained 
various obscurities which neither L. E. Dickson [1] nor the authors have 
been able to resolve. Bachmann did little to clarify these obscurities. L. E. 
Dickson gave a proof of Meyer’s theorem (cf. [1], pp. 35-60) which was very 
involved in detail and had a few obscurities of its own, though it was an 
improvement over Meyer’s discussion. Oral communication with Martin 


Kichler and Martin Kneser indicates that this theorem may also be proved 
using the deep theory of “spinor genera” developed by the former [2]. 


In this paper, by free use of matrices and the modern theory of quadratic 


forms including the Hasse symbol, we present a proof which involves a 


minimum of detailed computation and which we hope is largely free from 
obscurities. 

The outline of the proof is as follows: Theorems 1 and 2 show that two 
ternary forms are equivalent if they have the same invariants J and K, if 
they are both properly or both improperly primitive and if they represent a 
binary form with certain properties. Theorem 3 gives two conditions under 
which a ternary form f represent all binary forms of determinant JM where 


* Received February 11, 1955. 
* This was written under the support of the Office of Naval Research. 
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M is a prime or double a prime according as the reciprocal form of f is 
properly or improperly primitive. Theorems 4, 5, 6, and 7 establish the 
satisfaction of the first of these conditions and Theorem 8 the second. The 
remainder of the proof is concerned with showing that any two forms of the 
same genus represent a binary form of determinant JM where M has the 


required properties. 


TueoreM 1. Let B=(b,,) be the matrix of a properly or improperly 
primitive binary form, h, of determinant JM and let K be an integer such 
that g = (M,K) is square-free and prime to both M/g and K/g; then there 
exists a ternary form f with invariants J and K which represents h primitively 


if and only if there are integers c, and cz such that 
(1 ) b,,K, = booK, (mod M ). 


Proof. Let A be the matrix of the form f. The f represents h primitively 
if and only if there is a 3 by 2 integral matrix P whose two-rowed minor 
determinants have g.c.d. 1 and for which P?7AP=B. Then there is an 
integral column matrix Q such that U = (P,Q) is a unimodular matrix and 


UTAU =E= (bij), j= 1, 2, 3. 


That is, f represents h if and only if it is equivalent to a form of matrix / 
whose leading two-by-two minor is B. 

First suppose f with invariants J and K represent h, where JM is the 
determinant of h and (bi) is the matrix of h,i,7—=1,2. Then write 
= (Bi) where and B,=0(modJ) for all 1,7. Now 
adj(adj =J*KE. Hence 


(2) Bo2B33 — B*23 =JI*K bu, 
(3) B,, — B*,3 = bo, 
(4) By2Bs3 — By o3 = — J? K by, 


which show that we may take c,;—B.;/J, co—=B,3;/J and have the con- 
gruences (1). 

Conversely, if the congruences (1) hold equations (2), (3), (4) with 
¢, = B.3/J, C= B,;/J are solvable for B,, and B,.. Thus the matrix 
(Bi) exists. Suppose the elements B;;/J had a common prime factor q. 
Then g? would divide Kb,;, Kbo., Kb,. and since (0j;, b22, 612) =1, would 
divide K. But B;;—JM and hence g|M. However g would not be prime 
to (M,K) unless gq? divides M which is impossible since (M,K) is square- 


free. Thus 1 is the g.c.d. of B;;/J. A little calculation shows that /*K° is 
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the determinant of (Bij). Then adj(B,)/J*K is equal to a matrix H whose 
leading two-by-two minor is B and which has the invariants J and K. 

Let us henceforth take (J, K) =2, (mod 4), and let w 
or 2 as f is properly or improperly primitive. Notice that if w=2, K= 


(mod 4) and J is odd. 


6 


THEOREM 2. Two ternary quadratic forms of the same invariants, J, 
K and w, are equivalent if they both represent primitively the same primitive 
binary form of determinant JM in which M is an odd prime or twice an odd 
prime such that (M,K) =g and (g,K/g) =1. 


Proof. For M a prime or twice a prime and (g, K/g) =1, the preceding 
theorem holds and the congruential conditions (1) have only the pair of solu- 


tions ¢,, ¢. and the corresponding —c,, —cs, modulo M. The corresponding 

ternary forms obtained are related by the transformation{ 0 1 0 }, whereas 
0 0—1 


the form obtained by using c,+tM, c.-+ uM with t, uw integers in place 
of c,, ¢, is related to the latter by means of the unimodular transformation 


1 0 —wu 
0 O 1 


We now seek to determine conditions under which the hypothesis of 


Theorem 2 holds. 


2. THroREM 3. If a ternary form f represents a primitive binary 
form —h of determinant JM where M is a prime or double a prime according 
as the reciprocal form of f is properly or improperly primitive, and if h’ is 
another binary form of determinant JM, then f represents —h’ if there are 
two primes p and q not dividing 2M |f | such that 


(i) Jf T, U is the least solution of x? —wpqKy? =1, then TA+1 
(mod wpq). 


(ii) The equations h=wpq, h’=wpq are both solvable. 


Proof. Consider h and h’ as having been transformed so that their 
leading coefficients are wpq and let B=(b,,) be the matrix of A with 
b:;=wpq. Since p, q do not divide M, b,. is prime to 6,, and is one of 
the four roots of z?7==—JM (modwpq). If we show that some unimodular 
transformation of f leaves b,, and | B| fixed, but replaces bie by bi.’ 4+ bie 


(i). 
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(mod wpq), then f represents —/h’ since the “b,.” for h’ is one of 6i2, 5,2’, 
— 

Write the matrix of f in the form (a,s;) where a,,=—},;, consider the 
reciprocal form and denote 


by R. The determinant of R is —b,,K. If P is an automorph of R, then 


the transformation * >) on F is equivalent to the transformation( j ral 


on f (leaving b,, and 0,,K invariant). P! is of the form 


Aootl t Asstt 


where f, w is a solution of z*—wpqKy*=1. Now 
t— 
— 


= dyot — + + =Aret (mod wpq). 


Thus },.’==b,.t (modwpq) and, if t4+1(modwpq), then 61.’ 
(mod wpq). 

We now find the conditions which must be imposed on p and q if (i) 
and (ii) of Theorem 3 are to hold. First in the next four theorems we show 
that if the product pq satisfies certain conditions (mod 8) and (mod K), and 


if for some fixed factor d, of K, p satisfies the condition (—d,|p) =—1, 
then 
(5) 2? —aKy’? =1, a= 


has a minimum solution T, U with (mod wpq). 


THEOREM 4. If K =2°G*H where GH is odd, H square-free, (a, K) =2 
and a is odd or double an odd according as w=1 or w=2, and T? —aKU’? 
=1 with T=+1(moda), then for some factorization dd’ of H with d, 
d’ relatively prime and positive, the following equation has a solution where 
cc’ is the highest power of 2 in 2°U? and not both c, c’ are divisible by 4: 


(6) dcx? — d’c'ay? = + 2. 


Proof. Equation (5) implies (7 and 
T=-+1(moda) implies 


(7) T = 1—ad'e'r,’, T + 1 = der,’, (71,2) = 1, 


Ars Az: 

Az; 

\ 

( 

h 
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d 
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d’ 

an 
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since 7’ = 1 and T +1 have at most a factor 2 in common, where r, and r. 
are odd, = and dd’=TI//. Subtracting the first equation of 


(7) from the second, we have 
(8) (T+1)—(T=1) = +2 —der,?—ad’e’r,’, 
which completes the proof. 


Since equations (6) differ according to the parity of e, we specialize 


these results to find 


(1) If e—0, ¢ and c’ may both be 1 or one is 2 and the other an odd 
power of 2 greater than the first; in the former case the equation (6) becomes 


(6,) dx? — d’ay” = + 2. 
If c and c’ are both even, we have 


(6.) dx? — = +1 
where 2 or y is even. 


, 


(2) Ife—1, one of c, c’ is 2 and the other is an even power of 2. Then 


(8) becomes + 1 or +2—d(Vc)?r.2 — 2ad’r,’; 


hence we have one of 
(6, ) dz? — 2ad’y? = + 1, (64) da? — 2ad’y? = + 2. 


We show that by proper choice of a modulo the primes dividing G but 
not H we may exclude from consideration equations (6.) and (6;) with 
d=1, and Note that d—1 implies d—H. 


TuHrorEM 5. If, according as e is 0 or 1, a@ or 2a ts so chosen that 
(— all | pi) =—1 or (— 2aH | p;) =—1 for all primes p; dividing G but 
not H, then, if T, U is the least solution of (5), it follows that if T=+1 
(moda) then (6) does not reduce to either of the following in the respective 
CAUSES 


(9) —aHy? =1, — 2aHy? = 1. 


Proof. Suppose for the least solution T, U equations (7) with d—1. 
l’—=H define r., 7, a solution of (6) or (6;) with d=1 and that 
is, a solution of one of the equations (9). First we take the case r, = (ir 
and show that 7, U is not the least solution of (5). For e=0, 2 —r.(4c)?. 
Yo=1,(4c’)2 in (9) gives ro? —aG?H (yo/G)? =13 xo, yo/G is a solution of 
(5) in which 7 =cr,?—1 and r,—r.V ($c) < T which vields our contra- 
diction. 
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If e=1, r, = Gr implies that yo= ($V is a solution of (9) 
with 2 < T. 

Second, suppose that 7,40 modG@), thus 17,=0(modG@) implies 
that r. contains a prime factor of G and (6,) and (6.) with d=1. d’=H 
implies that the prime factors of G which r, contains cannot divide H. For 
(9) to have no solutions with x divisible by a prime factor of G@ not dividing 
H it is therefore sufficient that (---aH/|p;) =—1 or (—2aH|p;) =—1 in 
the respective cases for all primes p; dividing @ but not H. This completes 


the proof. 


We now consider equations 
(10) g = dx? — bd’'y? = N, 


where ba or 2a according as e=0 or 1, with aw prime to 2H and 
N is prime to H. In order that (10) be solvable. it is necessary that 
Cyp.(g) = (—N,bH),, for all primes p;, where cp)(g) is the Hasse symbol 
(see [3], p. 47). We now choose 6 (mod #) so that. for any fixed N prime 
to H, there is one and only one value of d such that c,,(g) = (—N.bH), 


for all primes p; dividing /. 


THEOREM 6. Let py. be the prime factors of H. We can 
choose b (mod p;) 1=1,2,- + -.n so that, for any N prime to H, there will 


be one and only one d dividing H such that 
(11) Cp,(9) = (—N, ),, 
for all primes p; dividing H. 


Proof. First compute c)(g) and find it equal to (—1,bH),(d.b1), 
= (—d.bH),. Suppose (—d,,bH), = (—d.,bH), for all such p. This 
implies and is implied by (d,d.,bH), 1 for such p. Hence our theorem 
will be proved if we can show that, for a proper choice of b, (d,bH),=1 
for all p dividing H holds for one and only one value of d (namely d= 1). 

In order to prove this by induction, assume first that H is a prime: 


then the two possible values of d are 1 and p. Since (1,bp),—1, we want 


to choose 6 so that (p,bp)p,——1. Now (p,bp),= (p,—b)p= ). 
Hence choose b so that the latter is —1. 
Assume the theorem for H containing prime factors p,, pn 


and that b has been chosen so that (d,bH),—1 for all primes dividing H 
only when d= 1. In order to establish the theorem for p,H choose b’ so that 
(b’pn| pi) (b| pi) for 1=1,2.: - -.2»—1. We must show that for d+! 


neither of the following is 1 for pj = p,, Pn: (d, b’'Hpn)p,, b’H pn)p: 
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First. (d, b’H pn)p; = (d, bH)», for 1=1,-2,- --,n—1 and since the latter 
is —1 for some i, the former is also. Second, (dpny, b’H pn)p, = (dpn, 0H)», 


for i=1,2,- - -,n—1 and the latter symbol is equal to (d,bH)p,(pn, 6H)»; 
thus. if (dpp. b’H pn)», = 1 for i= 1, 2,- - -,n we have (d,bH)p, = (pn, 0H)», 
for i= 1.2.: -.n—1. Thus, by the hypothesis of our induction, there 


is one and just one d, i.e.. dy, which has this property. Then, choose 0’ 
(mod p,) so that (dopn, b’H pn)», =—1. This is possible since (dyp,, b’H pn)», 
= (dy. b’H py) p, (pn. pn) p, = pn) (—v’H | Pn). Since for d=1, 
— (—1.)//), for all primes dividing H, the above shows that we need not 
consider (10) with V=1 for dA1. Theorem 5 shows that proper choice 
of « (mod G) also excludes VN =1, d=1. This completes the proof and we 


complete the exclusion of (10) by the following theorem. 


THEOREM 7. There exist odd numbers a, B incongruent (mod4), a 
divisor dy of HT, and a choice of sign + so that (10) with N =—1,2,—2 


have no solutions for the following conditions imposed upon «a: 
(1) Jf e=0,. al =1 (mod 4) or aH =5 +2 (mod 8). 
(2) Jf e=1=—w. aH =z or B (mod 8), 


(3) a=wpy where p and q are distinct primes nol dividing 2H and 
(— d,|p) — 


Proof. If e=0, we may assume from the previous theorem that a is 
chosen (mod A’) so that for all p; dividing H each of the following three 
equations is solvable in R(p;) for only one divisor d of H: 


(12) —a(H /d)y? = —1, —a(H/d)y? = + 2, dx? —a(H/d)y* =— 2; 
that is. 
(13) (—d. all) p; = 1, (2d,aH) p; = 1, (— 2d, aH) p;=1 for all p; dividing 


Let the respective values of d be d,, dz, ds. First, if aH =1 (mod 4), 
the last two equations of (12) have no solution and we may exclude the 
first by choosing p so that (—d,,aH), = (—d,|p) =—1. Second, consider 
aH == 3(mod 4). Now (—d,, aH)», = (2d2,aH),,=1 implies (— 2d,d., 
= 1 for all p; dividing H. Since there is just one divisor d, of H such that 
(— 2d;,aH),,—=1 for all p; dividing H, did, is, except for a square factor, 
equal to d;. Similarly. any product of two of d,. ds, d, is, except for a square 
factor, equal to the third. This implies that at least one of d,, ds, ds is con- 
sruent to 1 (mod4). Suppose —-d,==—1 (mod 4), then aH =3 (mod 4) 
implies (—-d,,aH).——-1 and one of (2d..aH).. (—2d,.aH), is —1. Sup- 
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pose that it is the former. Then choose p so (— 2d3, aH)» = (— 2d;| p) == —1. 

If e==-1—w’, we have the same set of equations except that a is replaced 
by 2a. There are two odd numbers @ such that (2d2, 2a). = (2d2, 2)o(2dz, «). 
—-—1 and the two are incongruent (mod4). Make aH =a (mod8) and 
see that two of the equalities in (13) are denied for pj? and the third 
may be denied for pj=p. 

If e=1, w=2 equation (10) becomes N = dx? — 4Apqd’y? which is not 
solvable for V = 2 or —2. The other equation is excluded by (— dy|p) =— | 
where d, is the value of d for NV =— 1. 

Theorems 5, 6 and 7 list restrictions imposed on p and q (mod8h’) 
which assure the satisfaction of condition (i) of Theorem 3. We now 


consider condition (ii) of Theorem 3. 


Trrorem 8. Let h and h’ be binary forms of the same primitive genus 
and of determinant JM ; let dy be some integer prime to 2JM and (JM, K) 32: 
let h represent an integer dy satisfying the conditions (mod 16K) imposed by 
Theorems 5, 6 and conditions 1 and 2 of Theorem 7%. Then there are two 
primes p and q such that (—d,|p) =—1, wpq is represented by h and I’, 
and wpqg=<d> (mod16K). 


Proof. We carry through the proof for h properly primitive. If h and 
h’ are improperly primitive the same proof holds if we replace h and h’ by 
th and 4h’. This of course replaces JM by JM. 

The form hh’ derived by composition from h and h’ is in the principal 
genus and hence arises by duplication (cf. [3], p. 167) ; hence hh’ = v? and, 
denoting the class vh’-! by u, we have h’=vu-'. Since d, is an odd factor 
of K and J. is the determinant of v with (K,JM) S 2, there is an integer p’ 
prime to 2|f | / such that (—d,|p’) =—1 and p’ is compatible with the 
characters of v. For any a compatible with the characters of h and prime 
to JM we can find an integer q’ prime to 2|f| M so that p’q’=a (mod m) 
for any preassigned m prime to p’. Then v represents infinitely many primes 
p=p’ (mod8/f|M) and thus and infinitely many primes 
(mod8|f!M). Let p. q be one such pair and have k= pq. h’ = pq both 
solvable. 

The preceding theorems combine to establish the following result. 


THEOREM 9. If an indefinite primitive ternary form f represents primi- 
tively a primitive form —h of determinant JM with (JM, K) S2, then it 
represents every binary form in the same genus as —h provided 


(i) M 1s an odd prime or twice a prime according as the reciprocal 
form of f is properly or improperly primitive. 


bo 


? 
a. 
3. f 
4. 
6. J 
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(ii) A represents an integer u satisfying the conditions (mod 8K) of 
Theorems 5, 6 and 7. 

Then in view of this theorem and Theorem 2, the proof of Meyer’s 
theorem will be completed if we establish 

THEOREM 10. Jf f and f’ are two indefinite primitive ternary forms of 
the same genus, if (J, K) S2 and neither J nor K is divisible by 4, then 
they represent, respectively, two primitive binary forms —h and —h’ having 
the following properties: 

(i) h and h’ are the same genus and have determinant JM, 


(ii) J is a prime not dividing 2JK or twice such a prime according 
as the reciprocal form of f is properly or improperly primitive, 


(iii) h represents an integer a satisfying conditions 1 and 2 of 


Theorem 7. 


Theorems 5 and 6 need not be considered here since (JM,K) S2, 
implies that 4 represents integers =a (mod p) for any odd prime p dividing 
K. 


The proof of this theorem requires two lemmas. 


LemMMA 1. Let the invariants, J, K and w, of a primitive indefinite 
lernary form f be such that (J, K) S32, JA0KK (mod4) and w—1 or 
2 according as the form f ts properly or improperly primitive and, except for 
the factor 2, M is prime to JK. Then f represents a binary form —h such 
that h==a ts solvable for a satisfying conditions 1 and 2 of Theorem 7% 
if M satisfies the following conditions exhibited in tabular form, where p. p. 


means properly primilive and 1. p. means improperly primitive. 


Numbers represented 


Invariants Conditions on M (mod 8) by h 
1. fp.p. J odd, K even M =—J (mod 4) and arbitrary 
e.(f) = (2|—JM) 
2. F p.p., K odd, J even arbitrary a = H (mod 4) and c,(f) =(2|@H) 
JM = = 2c.(f) (mod 8) aH =5=+ 2 (mod 8) 
3. fi.p., J odd, K even arbitrary a = 2 or 6 (mod 8) 
4. Fi.p., K odd, J even M =d (mod 8) a = K (mod 4) 
5. JK odd c.(f) = 1 and M = 3J (mod8) arbitrary 
c.(f) =—land M =/J(mod4) a =—H (mod 4) 
6. J = K =2 (mod 4) ¢,(f) = (2a,—JM), @ = aH (mod 8) 
e,(f) = (28, —JM), a = BH (mod 8) 


(« and f defined in Theorem 7) 


= 
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Proof. Since there are no requirements imposed by Theorem 7 for / 


improperly primitive. we need consider only properly primitive forms f in 
this proof. First we show that under the conditions imposed, f represents 
a primitive binary form —h of determinant JM. Let g be the greatest od 
factor common to all the coefficients of h. Then g?|J and, since J and J/ 
have no odd factors in common, we may by an equivalence transformation 
on F have 

F=cJ Kx? -- bky? + Mz? (mod /,’). 


where J, is the greatest odd factor of J and bc is prime to J). Then 
+ cJMy* + beJ Kz? (mod J,”), 


and the leading binary of f has determinant JM and leading coefficient 
congruent (mod g*) to 6M which is therefore prime to g. Thus the coefli- 
cients of this leading binary form have no odd prime factor in common. 
This shows that h may be taken to be primitive. 

To show that h is properly primitive notice first that it can be im- 
properly primitive only if J1/=3 (mod4) or 4(mod8). We postpone the 
latter and consider the former. If, under this condition, it is improperly 
primitive, it can be considered to have a leading element — 2s ==2 (mod 4). 
Then 


(14) ce(f) (— 28K, —JM),. 


If A is odd, then by the conditions imposed on MV either c.(f) =1 in which 
case JM ==3 (mod 8) implies (— or ¢.(f) =—1 
and JM==1(mod4). Thus, for A odd, condition 5 of the lemma implies 
that f represents no improperly primitive forms of determinant JM. Ii 
kK ==2(mod4) and f represents an improperly primitive binary h of deter- 
minant J.V/, then f may be taken congruent to h + czx,? (mod 2") with c even, 
which denies f properly primitive. It remains to consider the numbers 
which h represents and (mod 8). 

If J.540 (mod 4), the numbers (mod8) which h represents will be 
determined by its Hasse symbol with p=2. Furthermore h represents a 
(mod 8) if and only if c.(f) = (aK.—JM),.. Now consider the conditions 


in order. 


1. Here M==—J (mod 4) shows that ¢.(f) = (2]—JM) and all odds 
(mod 8) are represented by h. 


2. Here the numbers a represented by h are given by c.(f) = (aH, — JM):. 
Thus if aH =1 (mod 4), a is represented if and only if c.(f) = (aH, — JM): 
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regardless of the choice of M. If aH =5 + 2 (mod 8) the choice 


of (mod 4) specified suffices. 
3. If f is improperly primitive, Theorem 7 imposes no restrictions. 


4. If =J(mod8), h represents numbers congruent to K (mod 4) 


and is properly primitive, for F may be taken congruent to 
g + 2cx,” (mod 2’), 


where g is an improperly primitive binary form. Af is then congruent to 


2g, + ex,” (mod 2”), where e==—1 (mod 4), and g, is an improperly primi- 


tive binary form. However g, cannot have determinant J.W/4 since the 
condition imposed requires J.//4==1(mod4). This shows that f does not 
represent an improperly primitive binary form of determinant J.W//4. On the 
other hand, the odd numbers represented by Af are congruent to e, that is. 
congruent to —1(mod4). This shows that the odd numbers represented 
by h are congruent to KW (mod 4). 


5. If e.(f) =1 and JM==3 (mod 8), then c.(f) = (aH,—JM), for a 
arbitrary. But c.(f) =—1 and J. =1 (mod 4) shows that — 1 = (aH, —1). 


and hence aH =-—1 (mod 4). 


6. Here a==a2H (mod8) if and only if ¢.(f) =(2a.—JM),. This 


completes the proof. 


Our final step in the proof of Theorem 10 and hence of Meyer’s 


theorem is 


LemMMA 2. Jf the invariants J, K and w satisfy the conditions of the 
theorem and if f and f’ are indefinite forms of the same primitive genus 
with these invariants, then F and F’, their reciprocal forms, represent a prime 


M or twice such a prime satisfying the conditions imposed on M in Lemma 1. 


Proof. Since the classes f and —f are class isomorphic, for this lemma 
we take J and K to be negative integers. This makes the binary h and the 
binary g, determined below, indefinite forms and therefore capable of repre- 
senting both positive and negative integers. 

If the form f represents wq where q is a prime not dividing 2! f| then, 
by Theorem 9 with f and F interchanged, F represents a binary form —g 


of determinant gA and all binaries in the genus of —g and hence all 
numbers —-r, where r is consistent with the generic characters of g, pro- 


vided conditions 1 and 2 of Theorem 7% hold after replacing a, J and K by 
—r, K and J respectively, that is 
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1’) If J is odd, —rJ=1(mod4) or —r/ congruent to a preassigned 
one of 3, % (mod 8). 


2’) If J is even, F properly primitive, —rJ/2 ==, or B,(mod8) for 


z, and B, preassigned with «,8,;==— 1(mod 4). 


By Lemma 1 with f and /’, « and —r interchanged, the representation 
of such an r by F will be assured by the following choices of wq exhibited in 
tabular form. (Note that co(f) =c2(F).) 


Invariants Conditions on qg Conditions (mod 8) on r 
f p.p., J odd, K even arbitrary -r = J(mod 4) and 
Ca(f) = (2| 
Kq = = 2c.(f) (mod 8) —rJ =5+2 (mod 8) 
F p.p., J even, A odd q = —H (mod 4) and arbitrary 
c2(f) = (2] —qH) 
fi.p., J odd, K even q = H (mod 4) —rz=ddJ (mod 4) 
Fi. p., J even, K odd arbitrary r = 2 or 6 (mod 8) 
Jh odd c.(f) = l and q= 3H (mod 8) arbitrary 
co(f) =—landg = H (mod 4) —r=—VJ (mod 4) 
J = K = 2 (mod 4) = (Ze,, r = a;//2 (mod 8) 
c.(f) = (28,,— Kq). = B,J /2 (mod 8) 


First, to show that such a choice of wy is possible, notice (|3], p. 87), 
f properly primitive implies that f represents all odds (mod8) unless K is 
odd and In this case the only odds excluded (mod 8) are 
those numbers b for which bH =—¥1(mod8). Inspection of the table shows 
that choice of g may be made consistent with this exclusion. Similarly if f 
is improperly primitive 29 is excluded (mod 8) only when c.(f) —=—1 and 
qH =—1(mod8), where the corresponding entry in the table is =H 
(mod 4). 

Next, comparison of the listing for r in the above table with the require- 
ments on M imposed by Lemma 1 shows that they are consistent except for 
the last case which needs further consideration. The only possibility of 
trouble for this case is when none of 


c2(f) = (2a,—JM)., c2(f) = JM), 
hold for M —a,J/2 or —B,J/2 (mod 8) ; that is, all of 


(2a, + 2a, )>, (2a, + (2B, + 2a,)>, (28, + 
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are equal to —cz(f). This is impossible since their product is equal to 


(#8, %,8:)2 which has the value —1. 

Thus we have shown that F’ represents a prime M, or double such a 
prime consistent with the requirements of Lemma 1. In similar fashion f’ 
may be shown to represent a prime q’ congruent to qg for an arbitrary modulus. 
Then F’ represents a binary form —g’ which represents a prime —M, con- 
gruent to M, (mod8K’) since the determinant of g’ has at most a factor of 
2 in common with A. Then a prime (modq) and (mod q’) 
as well as congruent to M, (mod 8K’) will be represented by both F and F’. 
This completes the proof. 
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ON THE BEHAVIOR OF INVARIANT CURVES NEAR A HYPER- 
BOLIC POINT OF A SURFACE TRANSFORMATION.* 


By SHLOMO STERNBERG. 


1. In |2], Poincaré considered surface transformations of the type 
(1) f(x,y), 


where s and ¢ are real with s > 1 >¢> 0, and where f and g are real analytic 
functions of x and y which vanish at the origin together with their first 
derivatives. He proved, by the majorant method, that there exist two analytic 
invariant curves each tangent to one of the axes. In the case of dynamics, 
the relation st 1 holds (as a necessary condition for the transformation to 
be area preserving). We shall not make use of this condition. Hadamard 
[1], treating the case where f and g are merely assumed to be of the class C* 
(and to vanish at the origin along with their first derivatives), showed the 
existence of invariant curves by the method of successive approximations. 
In paragraphs 2-6 we shall treat questions of existence and uniqueness of 
invariant curves by a geometric method under slightly less restrictive con- 
ditions and show that our results are, in a sense, the best possible. In 
paragraphs 7 and 8 we shall deal with questions of smoothness of invariant 
curves and show that a C” assumption on the non-linear terms implies that 
the invariant curves are of class C"” (n21). The problems treated in this 


paper were suggested to the author by Professor Wintner. 


2. Let 7 be a transformation of type (1) which is topological in some 
neighborhood of the origin. Let s>1>¢> 0, and let the functions f. g be 
continuous and o(r) as r= (a*?+ y*)3—0. It is then clear that for all 
points sufficiently close to the origin, the y coordinate is decreased and the z 
coordinate is increased upon application of 7. Thus if an are passing through 
the origin is to be invariant, it must be tangent to one of the axes or else 
experience oscillations of increasing amplitude and frequency near the origin. 
In the latter case we shall, naturally, not speak of an “invariant curve.” 
Hence, either the arc is tangent to the x-axis at the origin, in which case 
all points of the are move in toward the origin upon application of 7; or 
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the are is ene to the y-axis, in which case all the points move away from 


the ori gil 


3. In this paragraph and the following, S and 7 will denote trans- 
formations of type (1) where é>1.¢>s and s>1, respective’ 


THEOREM 1. Let S be a transformation of type (1) where 0<s <1. 
s <tand where f, g are continuous and are o(r) as r= (a? + 30. Then 
there exist a neighborhood N of the origin and a closed set EK in N_ such 
that 1) E is invariant under S, 2) for every abscissa x in N there exists a 
point (x,y) of HE, 3) E ws tangent to the x-axis of the origin (1.e., given any 
«> 0, for all sufficiently small |x|, all points (z,y) of F# lie in the angular 
region |y|Se|z]). 

Proof. We can restrict our attention to the right half plane «= 0 since 
the argument is the same for the left half plane. Consider a point (2, y) 
on the line yar, >0. Its image will be the point (2,,y,), where 
Thus 


= {at/s + g(a, y)/sx}/{1 + f(a, y) 


Since y= ax and both f and g are o(r), we find that 7, < x and y, > a2, for 
all sufficiently small positive z, say, for 0< 7S. 
In other words, a point on the line y = az is moved to the left and above 


the line by S for all e< a. Similarly, a point on the line y=— az is 
moved to the left and below the line. For all « < 2, let E°(x) denote the 
vertical line segment joining the points (#,— and (a,ar). Now will 


map H°(x) into a set containing a connected are joining a point above the 
wedge |y| az to a point below the wedge. Let H1(x) denote the subset 
of E°(z) consisting of those points whose images under S lie in the wedge 
Sar. If - -,£"*(x) have been defined, let H"(x) denote 
the subset of #”-1(x) consisting of those points whose images under S” lie 
in the wedge. Since, by induction, the image of #"-"(x) under 8” contains 
a connected are joining a point above the wedge to a point below it, H"() 
is not empty. E(x) is clearly closed for all m. Hence E(x) =f) E"(z) 


is a non-empty closed subset of H°(x). It is clear that given any e we can 
find a positive number € such that for all 0< «Sé and all kZe, S takes a 
point on the line y=—kze into a point above the line y= (k + r)a where 7 is 
independent of « and k. Hence for a point (z,y) with eSé to lie in HL 
we must have |y| <ex. Thus H(z) is tangent to the z-axis at the origin. 
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Let E=|J) E(x). Then # is clearly invariant under S. In fact, any 
Zz 


subset of the wedge invariant under S is eventually contained in #. Further- 
more, a point P = (a, y) of the closed wedge | y| S az, 0 is not in 
only if S”, for some positive n, carries P outside the closed wedge. But then 
a small neighborhood about P is carried outside the wedge by S". Thus the 
complement of £ is open, so that F is closed. This proves 1), 2) and 3). 


THEOREM 2. Let T be a transformation of type (1) where s>1, s>t 
and f and g are continuous and o(r) as r—>0. Then there exist a neighbor- 


hood N of the origin and a closed set E satisfying 1). 2). and 3) of Theorem 


1 where, by invariance, we mean T(E)AN=E. 


temark. If T is topological, then Theorem 2 can be obtained by 
applying Theorem 1 to S=T™". 

In order to prove Theorem 2, let us consider an angular wedge W: 
'y| kx. Then from arguments similar to the above we have 


(T(W) AN) C (WAN). 
Let 
F,=T(WAN)ON 


and in general F, =7(F,.) ON. It is clear, by induction, that F’,.. C Fy 
and thus H=/[)F, is a non-empty closed set. It is clear from argumenis 


n 


similar to those of the preceding theorem that 1), 2) and 3) are satisfied. 


4. We shall now show that, under the hypotheses of Theorem 1, H(z) 
can contain more than one point. S will be a transformation of type (1) 
in which f==0. We shall choose g such that g(2z,0) —0 and so that the z-axis 
is an invariant curve. 

Let y=w(z) be a smooth curve tangent to the z-axis at the origin and 
such that y(z) >0 for z>0. In order that y(z) be an invariant curve, 
it is necessary and sufficient that the functional equation 


(2) ty (x) + =y(sz) 
be satisfied. Rewriting this we obtain the condition on g 
(2”) =y (sz) —ty(z). 


We now define g as follows: Let g(z,y) be 0, {y(sx) —ty(zx) }y/p(ax) or 
y(sx) according as y=0, OS ySy(z) or yZzy(z). We must 
now verify that the mapping S, with f==0 and g defined as above, satisfies 
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the conditions of Theorem 1. (It is clear that y can be so chosen that the 
mapping is even topological.) Now g(a, y)| Sy(se) +ty(x). Thus g is 
o(|z|) and so a fortiori o(r). 

We have constructed g so that (2’) is satisfied; hence, both y—y(z) and 


y= 0 are invariant curves of S. 


5. We now consider the question of uniqueness. 


TueorEeM 3. If the hypotheses of Theorem 1 are satisfied, and tf both 
f and g satisfy uniform Lipschitz conditions with respect to x and y, where 
in a neighborhood of the origin, the uniform Lipschitz constants are less than 
8<4(t—s), then, in a suitably small neighborhood N, E is a curve y=y(r). 
In other words. there exists a unique invariant are in any wedge |y| Sk |r| 


for sufficiently small | a |. 


In order to prove this theorem it will suffice to show that given any 
wedge | y|< kr there exists a positive number € such that there cannot be 
two points (x,y) and where and y>~y’, whose images 
under S” are in |y| kx for all non-negative n. 

Since f is o(r), any point in |y| <r with sufficiently small abscissa 
«(>0) is mapped into a point whose abscissa is smaller than (s+ )"x. 
Thus if (tn, Yn) and y’n) denote the images under S” of (2,y), (x, y’), 
we must have 2p, 2’, << (s+ and consequently, | | < 2k(s + 
in order that these images all lie in |y| <hr. We shall show that this is 
impossible for all suitably small .. 

From (1), = — + Yn) — Thus, 
for all sufficiently small .r, 


IA 


Similarly, yn — y/n = — + Yn) — 80 that 

Now assume that 

(3) | 1 | | Yn | ; 

then we have 


< (s + 28)| Yn-1 and | Yn — | > — Yn-1 —¥'s 1 . 


Since 8< 4(t—s) we obtain | << | yn—y'n|. As | 
’ =0<|/4%—y’,|, the inequality (3) is true for all n, by induction. We thus 
obtain 


| yn —y'n| > (L—28)| OF | > |. 


| 
i 
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We must therefore have (t—28)"|y—y’| < 2k(s-+)"*, which is clearly 


impossible for all n if « is chosen to be less than 3(t—s). Thus y=y and 


the theorem is established. 


6. Let us now consider a transformation 7’ satisfying the hypotheses 
of Theorem 2. The set HZ, in virtue of its construction, contains any invariant 
set lying in the intersection of any angular wedge | y|=k« and some suffi- 
ciently small neighborhood of the origin. Now let G be any set lying entirely 
in such a region. It follows, from reasoning similar to that of paragraph 3. 
that the image of this set under 7, G,, still lies in the angular region 
Let Go, be the sequence of sets so obtained. Let 
C be the set of limit points of this sequence. C is obviously an invariant set 
and hence a subset of FE. In particular, if / reduces to a curve, then all the 
successive approximations converge to the invariant curve. Let us now assume 
that both f and g satisfy uniform Lipschitz conditions (with respect to r 
and y) where the Lipschitz constant can be made arbitrarily small by 
restricting ourselves to a small neighborhood of the origin. Let y= (~) 
be a curve satisfying a uniform Lipschitz condition with Lipschitz constant K. 
Thus |(y—y’)/(c—2’)| < K. Using the same notation as before, we obtain 


y's) 
= {t(y—y’) + y) — — 2’) + f(a, y) — f(a’, y’)}. 


so that |(y,—y’,)/(a,—2’,)| < + (14+ K)8}/{s—(1+ K)8}, where 5 
can be made arbitrarily small by choosing x and «’ sufficiently small. Thus 
in a sufficiently small region about the origin, all the image curves satisfy 
a Lipschitz condition with Lipschitz constants all smaller than Kk. Hence 
the sequence is equi-continuous and the convergence is uniform. This is 
essentially the case considered by Hadamard [1]. We have thus proved 


THEOREM 4. Let T be a transformation of type (1) where s>1, s>1. 
f and g are continuous and o(r). Furthermore let the set E of Theorem 2 
be a curve. Then the sequence of successive images of a set situated in an 
angular region |y| <kx converge for sufficiently small x. If. in addition, 
f and g satisfy uniform Lipschitz conditions with Lipschitz constants which 
are 0(1) as r—>0, then the iterates of any curve y=¢(x) which satisfies a 
untform Lipschitz condition converge uniformly for sufficiently small x to the 
invariant curve, which then must satisfy a uniform Lipschitz condition. 
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7. We shall now consider differentiability of the invariant curve. 


TneoreM 5. Let S be a transformation of type (1), where 0>s> 1, 
s>1 and where f and g are of class C' and vanish together with their first 
derivatives at the origin. Then the unique invariant curve tangent to the 
r-axis at the origin is of class C’ for all sufficiently small x. 


In order to make the method clear, we shall first go through the proof 
for the particular case when f(a,y)=0. Let y=y(a) be the invariant 
curve. It must then satisfy the functional equation (2). Writing this 
and subtracting, we obtain 


, 


equation for two distinct points « and 
ty(x) — + g(a, — g(x’, = (sx) — 


Applying the mean value theorem, we obtain 


(y(x) —y(2’)} —2’) 
= [t+ 90’, [— 9 + — Y(s2’)} — sz’) ], 


where both € and &* lie between x and .’. 
We have thus expressed {Y()-—v(2’)}/(«—.’) in terms of 
{y(sx) — p(sx’)} /(sa — sz’). 
Repeating the process n times, we obtain 
—¥(2’)} — 2’) 


n k 


+ s"{ (¢+ W(&-1))}-1 — W(s"2’)} — 
j=l 


where both and lie between six and 
Now v(x) satisfies a uniform Lipschitz condition (by Theorem 4) and 
gy | <¢—s—8 for all sufficiently small x and +’. Thus the second term 


in the above equation tends to zero and we obtain 


Since g, and g, are 0(1) as r—>0, this series is majorized by a series of the 


2’) + gal. Wx) + WE) 2’) 
= —y(s2’)} a or 
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form > s**(t—e)* and so is uniformly convergent. We can. therefore, let 
k 


raw and obtain 


© k 
k=1 j=a 


In the case where f is not identically zero the iteration is more difficult. 
The functional equation satisfied by the invariant curve y(z) now takes the 
form 


ty (x) + g(x. = Y(se + f(r. 
(This is the functional equation used by Poincaré [2]). Proceeding as before, 
+ glx. — g(2", = Yar) — Ya"), 
where (x,,¥(.r,)) and are the images, respectively. of (2, ¥(2x)) 
and (2’.¥(2’)). Thus 
t{y(x) — (2")} + + gl’. WE) (W(x) — — 2’) 
= — 1) [8 + W(2)) 
where €, &*. » and 7* are all between 2 and 2’. Hence, the iteration takes 


the form [ |Jo—{(s+oo)[ 8} {eof |], (é+ where 


8 eo = f(a’, W(m)). To = 
Introducing corresponding notations for the k-th iterate of this relation. we 
obtain 
; 


and the product denotes matrix multiplication. 
We now wish to establish the convergence of this iteration. We first show 
that for all sufficiently small x and 2’, the product 


can be put in the form 


A, k _&k 
( where A, =AT] aj, 0, =C IL 
C, 1+ Dz 


with |a;)<r<1. ly;|<r<1, and By. Dy = Dy. with 


Il +o; 3; 
j=0 €j t + Tj 
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| Bi | <p*, |. | <p*, and |p|<1. This is certainly true for the case k= 0. 
We shall prove that the matrix has this form for all n, by induction. Thus 
assume the proposition true for the case k =n. Then the (n + 1)-st matrix is 


B, ) s+ on 8n ) 
C, 1+ D, ( En t+ tT 


Now since this matrix product is considered as acting as a linear fractional 
transformation we may divide all four elements by the same amount without 
changing the effect of the matrix. Carrying out the matrix multiplication 
and dividing by ¢+7, we obtain the matrix 


( A,(s On) pn Andnpn B, ) 
+ On)pn + (1+ Dn)éenpn 1+ D,+ Cr8npn 

where py» =1/(t-++7,). Now by virtue of the inductive hypothesis, | B,| <B 
and | D, | < D, where B and D do not depend on n. Since on, 8n, €ny Tn are 
all uniformly 0(1) (as 2 and 2’ go to zero), we may choose x and 2’ so small 
that 


+ on + €nB/A)/(t + t)| <r <1, 
On + + Dy) /C)/(t < r < 1, 
| <p <1, | C8:/(t + | <p <1. 


Thus the (n+ 1)-st matrix has the desired form. It should be remarked at 
this point that the constants C and D are still at our disposal. 

As before, we know that the [ ], are uniformly bounded. We now choose 
C and D so small that the denominator of the n-th iterate never vanishes 
and the uniform convergence of the iteration process is established. 


Corotuary. If, in addition to the hypotheses of Theorem 5, f and g 
are assumed to be of class C*, then wy is of class C" in some neighborhood of 
the origin. 


Proof. The function y’(x) can be represented as the quotient of two 
series both of which are majorized by } Kr" with r<1. In taking successive 
difference quotients, one obtains series majorized by K > M*(n!/(n—k) !)r*, 
which is also uniformly convergent. 


8. Let T be a transformation of type (1), where f and g are of class 
C"(n=1) and vanish at the origin together with their first derivatives. 
Furthermore, let s>1>¢>0. Then both 7 and T-! satisfy the conditions 
of the corollary to Theorem 5 and hence, there exist exactly two invariant 
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curves of the transformation 7’, each tangent to one of the axes at the origin. 
We can perform a rotation of the coordinate system so that now neither 
invariant curve touches an axis. In these new (u,v) coordinates the two 
invariant curves may be represented as v—¢,(u) and v—¢.(u). We now 
write X =~v—¢,(u), Y =v—¢.(u). Since the curves ¢, and are 
perpendicular at the origin, the Jacobian of this transformation does not 
vanish. The transformation 7 is therefore defined in these new variables, 
We thus have 


THEOREM 6. Let T be a transformation of type (1) with s>1>t>0. 
Furthermore, let f and g be of class C"(n=1) and vanish at the origin 
together with their first derivatives. Then there exists a non-singular change 
of coordinates X =X(z,y), Y=Y(2,y) of class C" so that in the new 
coordinates the axes are the invariant curves and T has the form 


Y,=t(Y+ G(X,Y)), 
where F and G are both of class C", and where 


F(0,y) = G@(a,0) =F,(0,0) =F, (0,0) =G,(0,0) =G,(0,0) 


THE JOHNS HopxKINS UNIVERSITY. 
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ON A PROBLEM OF LITTLEWOOD.* 


By R. SALEM. 


In his “Research Problems” Littlewood raises the question to know 
whether it is true that, given n distinct integers m,m2,° - -,m,y, there exists 


an absolute constant A satisfying 


f | emiiz + + emniz | dz > A log n. 
0 
The purpose of this paper is to prove the following weaker result. 


THEOREM. Let mn, n=1,2,: +, be an increasing sequence of positive 


integers such that logm,—O(logn). Then, as n>, 


dx/ (log n)3 > 0. 


lim sup f |cosm - -+ cos myx 
0 


The proof is extremely indirect and is based on the refinement of an 
argument used by the author (see Salem [2]) to give a new proof of the 
theorem of Menchoff on the convergence almost everywhere of series of 
orthogonal functions: namely, if {¢;},% is any orthonormal system, and if 
> log? <2». then converges almost everywhere. 

Lemma I. Given Q(n) increasing to and such that Q(n) = o0(log? n), 
there exists an orthonormal system {¢;},°% in (0,1) and a sequence of coeffi- 
cients {a;},* such that Sa;?Q(j) <0 and that the series > diverges 
almost everywhere. 

This lemma which is equivalent to the rough statement that “ Menchoff’s 
theorem can not be improved” is known. (See Kaczmarz and Steinhaus [1].) 


Lemma IT. Given Q(n) increasing to « and such that Q(n) = o(log? n), 
there exist 

1) a fired orthonormal system {¢;}1% im (0,1), 
2) for each n, a non-increasing sequence {F;}," of characteristic func- 


tions of sets in (0,1) (depending on n), 
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3) for each n, a function QeL? in (0,1) (depending on n), with 


1 
Q*dx =1, such that 
0 


n 1 
lim sup QF dx)*/Q(n) = o. 
1 0 


Proof. Take for {¢;},*% the orthonormal system whose existence is 


asserted by Lemma J]. If it were true that 


(1) (fOr <Ca(n), 


C being a constant, no matter how the sequence {F;}," and the function @ 
were chosen for each n, it would follow, by a known argument (see Salem [2]) 


p 
that given any n numbers y,,- - -,yn and writing S,(z) = > y;¢;, one would 
1 
have 
n 
sup | S,(z) |? dx << C(S y;)?Q(n). 
0 1=p=n 1 
By the classical argument of Menchoff,t this would imply that the series 
> «;¢; converges almost everywhere whenever > aj°Q(7) <0. Hence. bv the 
particular choice of the system {¢;}, a contradiction to Lemma I. 
We shall now prove a third lemma, in which we shall adopt the tollowing 
notations : 
{0,(t)},* is any orthonormal, uniformly bounded system in (0,1), say 
| SM. 


is any sequence of coefficients, and s(t) 
1 
{m;},° is any increasing sequence of positive integers. 
w(t) is of the class Z in (0,1), and y(t) 21 (yw may depend on n), 
{6;(x)},* is any orthonormal system in (0,1). 


{F;(x)}," is any non-increasing sequence of characteristic functions of 


sets in (0,1) (which may depend on n). 


? We mean the argument of Menchoff’s theorem used to prove that for all ortho- 


1 n 
normal systems sup 7;*) log? and deducing from this 
1 


0 
inequality that converges almost everywhere whenever 7;? log? j< ~. 

In order to apply this argument (see, e.g., Kaczmarz and Steinhaus [1]) it is 
neecssary to suppose that 2(n)/log exceeds a constant. But this is irrelevant for 
the proof of Lemma II, since clearly we can replace, in (1), Q(n) by a function 
increasing more rapidly. 


W 
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For the sake of simplicity, everything is real. 


Lemna III. 


n vl 1 
=(f QF de) *em,? = f dt) - max Sm, (t) dt. 
j=l 0 J 09 0 


Proof. 


al al 1 
(2) a— f f Om;2(t)y(t) dts y(t) dt, 
e 9 0 
and consider the function 
P (a, t) — (2) 


where the {f;,(x)}.”» is a non-increasing sequence of m, characteristic func- 
tions of sets in (0,1) which will be determined in a moment. 


1 a 
One has, if P(x, en, f Ofm while 
0 0 


the left-hand side vanishes if 7 > n. 


(3) 


Now 


P( x,t) = fale) = 
where Af, = fx(X) Afim, 


fx(z), we have P?(z, t) = (t) and 


J P2(x, t)dedt = 
0 0 


Take now 


f,=F; for km; 


and write AF; = F;— Fy, AF, =F). 
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Q(x) is any function of the class L* in (0,1) (which may depend on n), 
1 
with Q?(2)dz—1. 
0 


We have, with the preceding notations, 


The system of functions p;(z,t) =4;(%)6m,(t)y(t)4 is ortho- 
gonal in the square OS¢=1, OS 751. 


Hence Bessel’s inequality gives 


n 1 Dp 1 
Om f f (a, t)dedt. 
j=l 0 0 0 


Remembering the definition of 


(m=O), 


| 
d 
= 
1 
1 
‘ 
n 
1SjSn 
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1 

since f Q*dx=1. Hence, by (3) and (2), 
0 


( ( OF M2( ydt) max f "Sm,2(t) /(t) dl, 
0 0 j= 0 


1SjSn 


which proves the Lemma. 


Proof of the theorem. The notations being the same as in the Lemma, 
take 6,(¢) (kt—«,), so that the series >) c;,6;,(t) becomes a trigono- 
metric series (the fact that the interval of orthogonality is (0,2) and that 
the system is not normalized is obviously irrelevant). We define now: 


y(t) Sm,(t)| +1, 


and we suppose that |c»,|=8>0. The lemma then gives, A being an 


absolute constant: 


J, 4/8 (f max f sm, (4) | dt. 


Now it is easily seen, since s,, is a partial sum of the polynomial s,»,, that 


(4) Jf, SC log m, Sm,(t) | dt. 


Hence 

n 1 21 

QF = AC/8? log m,- max { | Sm, (¢) | 

j=1 0 1SjSn 0 
Now suppose that logm,—O(logn). Then the second member of the last 
inequality is 


O(log n) - max { “| Sm, (t)| dt}?. 
0 


1=jSn 
If we had lim “| 8m,(t)| dt/(log j)#=0, the inequality would give 
0 
n 
idx)? =o (log? n), 
0 


the second member being independent of {¢;}:°%, {Fj}:" and Q. This is 


impossible, by Lemma ITI. Hence, if in the series }\ c, cos (kt — a), whose 
1 
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m-th partial sum we denote by sm, we have | Cm, | 28> 0, log m;= O(log 7), 


then 


(5) lim sup J Tm, (t) | dt/ (log j)4 > 0. 


This result includes obviously our theorem, which is thus proved. 


Remark. Let us observe that if > c, cos (kt —a;) is a Fourier-Stieltjes 
series, one gets a stronger result than (5): for then (4) can be replaced 


by f, vata = Blog m,, B being a constant. We have then with the same 
9 


hypothesis 


n a 
=O (log n) max Sm, (t) | dt, 
0 


which leads, instead of (5) to 
(6) lim sup fi $m, (t)| dt/log j > 0. 
70 


a 
This is, in particular, the case, if lim int f | sm(t)|dt—O(1), since this 
0 


relation implies that } c, cos (kt—a,) is a Fourier-Stieltjes series. 


Other consequences of Lemma III. Lemma III can be used to prove 
other results. We shall give one example, which generalizes a result obtained 
earlier (see Salem [2]). 

By 6;(¢) we denote now generally, as in the Lemma, any orthonormal, 
uniformly bounded system in (0,1), with |6,(¢)| <M. Writing as before 


8m = we shall prove that if | Cm, |logj—>0o then sup | sm,(¢)| cannot 
1 j 


belong to L. 
In fact, suppose sup | 8»,(¢)| eZ, and choose, in the inequality of Lemma 
| 


IIT, y(t) =sup | sm,(t)| +1. The inequality becomes 
j 


Now, if |¢m,|logj7—>o, we can find a function w(j)—>0o such that 
o(j)/log j decreases and that | cm, |log7 >o(j). Hence 


(o(n) log n)? 3 ( f 


m 
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which again contradicts Lemma II, by a suitable choice of {¢;},:%, {Fj}, 


and Q. 
Hence sup | implies lim inf | cm, | log = O(1). 
j 


One deduces immediately (by considering a subsequence of {m,;}) that 
sup | Sm,(t)|eZ implies cn, — 0. 

In particular, sup | s,,(¢)|¢Z implies ¢,—>0 for any orthonormal, uni- 
formly bounded system. Direct proofs can be given of this result, a remark 


which I owe to A. Zygmund. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 
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SOME LINEAR MINIMAX PROBLEMS OVER AN ORDERED 
FIELD.* 


By R. J. Levit. 


1. Introduction. Let S be a subset of an n-dimensional vector space 
over an ordered field F fixed throughout the paper. If, as the vector 
X=(X,, -:,Xn) ranges over S, max|X;| has a minimum value M*, then 

j 


M* is called the minimax of S. Clearly, when M* exists, it may be charac- 
terized as the unique element of F' satisfying the two conditions, 


For every XYeS max | X;| = M*. 
(1.1) 


There is a vector X*eS such that max | X*;|—M*. 


A vector X* satisfying the second of these conditions is called a minimax 
vector of S. It may be regarded as least among the vectors of S in the 
sense that each component is at least as small in absolute value as some 
component of every other vector in 8S. 

Given a consistent system of linear equations with coefficients in F, 


n 
(1.2) = Vio 


j=1 
let G be a set of integers g:,--+,9s, 1SSg,ySn, and denote by S®% the 
totality of vectors (Xy,,° y,) such that (Xi,- - -,Xn) is a solution 
of (1.2). The minimax M®@ of S@ we call the minimaz of the system with 


respect to &y,,°- *,% , Thus, M@ is the minimum value of max|X,| as X 
jeG 


ranges over all solutions of (1.2). A minimax vector X% of S@ we call a 
minimax solution with respect to 2% Our main result is the deter- 
mination of M® and the characterization of the minimax solutions with 
respect to 2y,,° - *,%y, for the system (1.2). If certain submatrices of the 
matrix of (1.2) are non-singular, there is a unique minimax solution X%, 
and an explicit formula is given for it. These results are obtained in Section 4 
utilizing the already known case of a single equation [1], which is discussed 
in Section 2, and the notion of a Pliicker array, which is defined and 
developed in Section 3. 


* Received July 7, 1953; revised December 15, 1954. 
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In Section 5 the results of Section 4 are applied to determine the minimax 
M* of the range of a vector function L(2,- : -,%_) having as components 


the linear functions,* 


q 

(1.3) = dix; — bin (t=1,---,n), 
j=1 

as 2,° * *,%q range independently over F. In this case M* is called thy 


minimaz of the system of linear functions. The vectors X* = (4*,,-- +, X*,) 
such that max | L;(X*)| = M*, called the minimaz points of the system, are 


characterized also. 

In Section 6 the results of Section 4 and Section 5 are applied to some 
problems of best approximation in the Tchebycheff sense; namely determina- 
tion of best approximate solutions to inconsistent systems of linear equations’ 
and best approximation of arbitrary function on finite sets. The latter 
generalizes some results of De La Vallée Poussin [5]. 


2. The minimax solutions of a single equation. We consider the 
equation, 


(2.1) AjLj = Uy (ajeF). 


‘Mes 


1 


Lemma 1.1. If X ts a solution of (2.1) and a;0 for at least one 
integer j, 1S then max | where 


(2.2) =Ao/ > | a; |. 
j=1 


Proof. The result follows immediately from 


Xx; 


n n 
(2.3) | ao | ay! = (max | X;|) a; |. 
=1 j=1 
THEOREM 1. Denote the set {1,---,n} by N. If in (2.1) a; for 
at least one je N, the solutions have the minimax MN =|, |. Let D be the 
set of integers je N for which a;=0. Then XN is a minimaz solution if 
and only if 


(2. 4) XN; |S|ul(jeD); XN;=ypsgna; (je N—D). 


1 This problem has been solved in the field of real numbers by M. Krein [3] using 
functional analysis and in the field of complex numbers by V. K. Ivanov [2]. 

*This problem has recently been treated in the field of complex numbers by E. 
Remez [4]. 
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Proof. Consider first the case By the lemma 
satisfies the first minimax condition (1.1). For a solution XY of (2.1) 
equality holds between the first and second members of (2.3) if and only 
if the non-vanishing terms in the sum >\a;X; all have the same sign (which 
is then necessarily the sign of a)) and between the second and third members 


if and only if the X; all have the same absolute value. Hence, max | X; |= |p! 
if and only if sgn ya; and | X;|—| for 7—1,- - -,n. 


Since direct substitution with the aid of (2.2) verifies that 7;—=psgn a; is a 
solution of (2.1), || also satisfies the second minimax condition (1.1) with 
the second expression (2.4) as the minimax solution, which is unique in this 
ease. The proof for D empty is then completed by noting that, when a, — 0, 
the unique minimax solution is XY;—0, j—=1,---,n. Finally, when D is 
not empty, we apply the foregoing result to the system 3) ajaj—d). Then, 


since > |a;|—wp, the theorem is proved. 
jeN-D 


3. Sign-compatibility of Pliicker arrays. Any skew-symmetric square 
array over F' of rank two or less we call a Pliicker array (P.a.). The existence 
proof for minimax solutions of (1.1) depends on the property of these arrays 
stated in Theorem 2 below, which involves the following notion. Two vectors, 


(3.1) C, = (¢n,° C, = (C21, ° s 


over F are called sign-compatible (s.c.) if sgn cyjCojCixC2,A—1 for 
j,k=1,---,s. This relation is evidently symmetric and reflexive but is 
not transitive; e. g., of the vectors (1,1,1), (1,0,1), (1,—1,1) the first and 
second are s.c. as are the second and third but not the first and third. 


Lemma 2.1. Let the vectors (3.1) contain a pair of corresponding 
components Cyq and Coq such that Cieloa 0. Then C, and C, are s.c. if and 


only if 
(3. 2) SON C1jCoj SYD 
whenever 0. 
Proof. If C, and C, are s.c. and ¢1;C2; 40, Sgn 61jC2jCiaCog = 1, and (3.2) 


follows. Conversely, assume (3.2) holds whenever ¢,;¢.;540. Then for 
arbitrary j,k =1,- -,s either or 


SQN C1 jCoj = SON Crqloqg = SQN AO 


so that sgn ¢,;C2;C:,C2,—=1. Hence, C, and C, are s.c. 


R. J. LEVIT. 


Lemma 2.2. If vectors Cy, Cy, C, are rows of a P.a. C= (cy), then 


(3.3) + CrpCg + = 0. 
Proof. If 
(3. 4) Cpq = Car = Crp = 0, 


(3.3) holds trivially; hence, we assume (3.4) does not hold. Since the 
rank of C is not more than two, there are elements k,, k2, k;e F not all zero 
such that 

(3.5) key Cpj + + = 0, (j=1,-- -,8) 
where s is the order of C. For definiteness we assume k; 40. Then, writing 
(3.5) in the cases jp and j=—q and applying the skew-symmetry of C, 


we obtain respectively 


koCpq ksCrp 0, — 0. 


These imply that c¢,g+40, since otherwise (3.4) would hold. Hence, 
= = Substituting these values into (3.5), we 
obtain (3.3). 


LemMA 2.3. Two rows C, and Cy of a P.a. C= (cy) are proportional 
if and only if =0. 


Proof. Let 7 =1,: -,s, where s is the order of C and ke F. 
Then in particular, Cpg—=kcgg—=0, since C is skew-symmetric. Conversely, 
let Cpg—=0. If cp; for 7—1,- - -,s, Cp and C, are proportional. Other- 
wise, there is an element Cy,540; SO Cryp==—Cp~0. By Lemma 2.2 
CarC'p + CrpCg=0. Hence, C, and C, are proportional. 

A P.a. is said to be reducible if two of its rows are proportional. By 
the lemma just proved a P.a. is irreducible (not reducible) if and only if 
all its zero elements lie in the main diagonal. Let C= (cj) be a reducible 
P.a. of order s with row C, proportional to C4. 


(3.6) Coy = 


The sub-array C’ obtained from C by deleting the row and column of index } 
is called a direct reduction of C. An (s—n)-rowed minor array C™ of C is 
called a reduction of C if there is a sequence C’, C”,- --,C“-» of reducible sub- 
arrays of C each of which is a direct reduction of its immediate predecessor 
and such that C’ is a direct reduction of C and C™ of C@-, It is convenient 
to regard C) —C as itself a reduction of C. Clearly, a reduction of a P.a. 
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is again a P.a. We use the symbol C, to denote the row (Cijs° °°» Cijsn) 
of CM, 


Lemma 2.4. Two rows OM, and C™, in a reduction C™ of a reducible 
P.a. C= (ci;) are proportional if and only if Ca and C, are proportional 
and are s.c. if and only tf Ca and C, are s.c. 


Proof. It is sufficient to prove the lemma for n—1; i.e., for a direct 
reduction OC’ of C. Let C’g and C’, be proportional. Then by Lemma 2.3 
Cae = 0, and hence, Cg and C;, are proportional. Next, let and C’, be s.c., 
and suppose that C’ has been obtained from C by deleting the row and column 
of index b so that (3.6) holds for some integer a. Then, 


(3. 7) sen CajlejlamCem 
holds for every j,m=—1,- - -,b—1,b+1,---,s. But 


SQN CajCejCanCen == SQN CajCejCyaCne SQN = SQN — 1 


by (3.6), (3.7). That is, (3.7) holds for mb also so that Cg and C, 
are s.c. The converses are trivial. 


Lemma 2.5. If a P.a. C=(cy) contains a set of r rows no two of 
which are proportional, so does any reduction of C. 


Proof. It is sufficient to prove the lemma for a direct reduction C’. 
Let C’ be obtained from C by deleting the row and column of index 6b so that 
(3.6) holds for some a. Let I be the set of indices of the r rows in question. 
If b¢T, the conclusion is an immediate consequence of the preceding lemma. 
Otherwise, a¢ J and the set {C’;} for ic I— {b} + {a} is composed of r rows 
of C’. By Lemma 2.4 no two of these rows are proportional except possibly 
C’, and some row 0’, with de I—{b}. But, since C, and Cg are not pro- 
portional, ¢y,4540 by Lemma 2.3. Hence, ¢aa40 by (3.6), and accordingly 
by Lemma 2.3 C’, and O’g are not proportional either. 


LemMMA 2.6. Let (cj) be an irreducible P.a. of order s=4; define 
the vectors Vin = (Cix, * Cin), 8, and let d, e, m be integers, 
dAe1<cdsim<s,1<exm. Then if Vim is s.c. with both Vam and 
Vem, 8.c. with at least two of the vectors, Vamsi, Vms1mer 


Proof. Tf Vas and Vem are both s.c. with Vim, there is nothing 
to prove. Otherwise, we assume without loss of generality that Vams is 
not s.c. with Vim. Then there is an integer a, 1< am, such that 


SQN = —1, OF SQN CiaCaa = — SYD HO. But, by 
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Lemma 2.1 for every integer j ~d such that 1 <j SM, sgn ¢,;C4j == SQN C:qCaq, 
since for by Lemma 2.3. Hence, 


(3.8) SQN CjCaj = — SED C1, O 
and in particular sgn = — AO, SO that because of the 
skew-symmetry 

(3. 9) SQN CaelC1,m+1 SQN CerCa,m+1 0. 


Similarly, if were not s.c. with Vijms1, SM Ceali,m+1 = SYN 0, 
OF SQN Cacli,m+1 = AO. This combined with (3.9) would imply 
that CacC1,ms1 + CerCa,mer + C14Ce,m+1 contrary to Lemma 2.2. Hence, 
is s.c. with Since by Lemma 2.2 Cms1,1€ja + C1jCmit.a + = 9 
and, for jd such that 1 << jm, sgn C1jCms1,4 = Cms1,1Cja AO by (3.8), 
it follows that 


SQN Cj,ms1Cid = — SYD FO, OF SYN C1jCms1,j = SBD 


for 1 << jm; i.e., whenever ¢1;Cm41,; 40 by Lemma 2.3. Hence, by Lemma 
2.1 8.c. with Vi,ms1; and the lemma is proved. 


THEOREM 2. If a P.a. C=(cyj) contains a set of three rows no two 
of which are proportional, then any given row Cy containing a non-zero 
element is s.c. with precisely two rows Cy and C, not proportional to Cq or 
to each other. Cy and C, are not s.c. with each other provided that C 
contains a row not proportional to any of the rows, Ca, Cy, Cy. 


Proof. We may suppose without loss of generality that C, is C1, since 
a P.a. remains a P.a. when the rows are renumbered in any order provided 
that the columns are also renumbered in that same order, and since sign- 
compatibility and proportionality of rows is independent of their numbering. 
First we establish the existence of the rows Cy and C, when C is irreducible. 
In the notation of Lemma 2.6 and V3, are s.c. with since cy = 0. 
When the order of C is s—3, this means that C, and C; are s.c. with C;. 
When s = 4, it implies that at least two of the vectors, Vo, Vss, Vas, are 8.0. 
with V,, by Lemma 2.6 with m3. Similarly, applying Lemma 2.6 with 
m =4,5,- - -,s—1 successively, we find distinct integers f and g,1<fSs, 
1<gss, such that V;, and V,, are s.c. with V,,; i.e., Cy and Cz are s.¢. 
with C,. Now suppose C is reducible, containing n rows proportional to (C;, 
(besides C, itself). The minor array obtained by deleting these n rows and 
the corresponding columns is by Lemma 2.4 a reduction C™ of C, in which 
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(™), is proportional to no other row. By Lemma 2.5 C™ has an irreducible 
reduction OC, r =n, containing at least three rows, one of which is C™. 
As already shown for irreducible arrays, C” contains two rows, C™, and C™,, 
s.c. with C,. Hence, by Lemma 2.4, C; and C, are s.c. with Ci, and no two 
of the three are proportional. Next, if there is a row C, not proportional 
to any of the rows, Co, C;, Cy, then by Lemmas 2. 3, 2.1 sgn CigCyg = Sgn CinCyn, 
SEN C7Cof SYD CinCgn, ANd hence, sgn SEN CinCyg — SYN There- 
fore, SQN = =—1. Thus, C, cannot be s.c. with 
C,, and neither can C; be s.c. with C5. 


4, The minimax solutions of the general system. If the system (1. 2) 
is consistent and has rank r > 0, we can number the equations so that the 
subsystem, 


n 
(4.1) = Mo - -,#), 
j=1 


also has rank r. It is then clear that, with respect to any given subset of the 
unknowns, every minimax solution of (1.2) is a minimax solution of (4.1) 
and conversely. We, therefore, confine our attention to the system (4.1) 
with rank equal to the number of equations. Denote the matrix of (4.1) 
by A; i.e, A= (ay) is an r Xn matrix of rank r. Denote the augmented 
matrix . A, the r Xs matrix composed of the columns of A with indices 
jis’ * *>Jg in that order by A;,..;,, and the determinant of the square matrix 
A;,.;, by @,...;,. It is sometimes convenient to let J denote the ordered set 
1 and write for Aj,...;, and similarly for 
the «’s when sr. If J is the null set, we understand Aj,,...;, to be the same 
as A;,;,. The determinants a;,..;, play an important role in the minimax 
problem because they are essentially Pliicker coordinates for the (n—r)- 
dimensional subspace of intersection of the r hyperplanes (4.1) in projective 
n-space over F’, and they consequently characterize the totality of solutions 
independently of the particular system of equations through which it is defined. 
However, we shall derive the required properties of the a’s directly without 
recourse to this geometrical interpretation. Evidently an interchange of sub- 
scripts merely changes the sign of a;,..;,; and, if two or more subscripts are 
alike, If r>1 and J=(j,,: -,j,1), we denote the (r—1)- 
rowed square submatrix obtained by deleting the row of index 7 from Ay by 
A‘; and its determinant by a‘;. We then have 


(4.4) (—1) tat jay = ay; (7=0,- -,m). 


4=1 


548 


Lemma 3.1. Jf (X1,--+,Xn) is a solution of the system (4.1) and 


(4. 5) X = 
j=1 


Proof. By (4.4) 


n n Tr n 
j=1 =2 j= 


j=l i=1 4 


= >) (— 1) aig = a0. 
i=1 


By (4.4) if J=(j1,° the last row of the matrix 


| 
Ark, Ark, 1 | 
J 


is a linear combination of the first r rows, and hence, the determinant vanishes. 
Expanding by elements of the last row, we obtain the bilinear relations. 


LeMMA 3.2. Let K=(hk,,- be a sequence of r—2 distinct 
integers, Then the array (agi) for 14,7 —=1,- - -,n 1s a Plicker 
array. 
Proof. Kvidently, exi;—— Moreover, setting 7,—k, for h=1, 


2, 4, Bis d, — é, om f in (4. 6), we obtain 
(4. 7) + + 0. 


Thus, three arbitrarily chosen rows, (¢xin,---,%xin) for i—d,e, f, are linearly 
dependent. 
The following familiar result is stated here for reference. 


Lemma 3.3. If a matrix B has a non-singular r <r submatrix C and 
all (r+1) X (r+1) snbmatrices of B that contain C are singular, the 
rank of B ws r. 


Let J = (j1,---,Jr-+) be a sequence of r—1 distinct integers, 1 Sj, =n. 
The number w(J) of zero determinants in the sequence (aJ;), 7—=1,° 
we call the order of J with respect to A. Evidently, w(J) =r—1. If 
w(J) <n, we call J a proper column index sequence (p.c.i.s.) for A. Since 
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A is of rank r > 0, there is a determinant «;,.;,540. Hence, there is always 
a p.c.i.s. for A. (When r—1, the (only) p.c.i.s. for A is the null set.) 


Lemma 3.4. J=(ji,° *,jra) is a p.c.t.s. for A tf and only if Ay 
has maximum rank r—1. 


Proof. Let Ay; have rank r—1 so that, for some integer g, A%; is non- 
singular. Let A’ be a matrix obtained by permuting the columns of A so 
that A, occupies the first r—1 columns. The only r Xr submatrices of A’ 
that contain A’, are the matrices Ay; with j7#J. They cannot all be singular, 
since by Lemma 3.3 the rank of A’ would then be r—1 instead of r. Hence, 
at least one of the determinants a,; does not vanish; so J is a p.c.i.s. for A’ 
and hence, for A. Conversely, if w(J) <n, there is an integer f, 1S fn, 
such that ay>40 and hence, by (4.4) not all the minors at; are zero. 
Therefore, the rank of A, is r—1. 


Lemma 3.5. Let H be a sequence of q<r wmtegers hi,- - +,hg, 
1Sh,Sn. If the matrix Aq has maximum rank q, then there 1s a p.c.1.s. 
for A containing H. 


Proof. Let Ay have rank gq. If g=r—1, H is itself a p.c.i.s. by 
Lemma 3.4. Next, let g=r—1—s with lSs<r—1l1. Ag contains a 
non-singular g q submatrix Y. If every (¢+1) X (q¢q+1) submatrix of 
A containing Q is singular, then by Lemma 3.3 r=—gq contrary to hypothesis. 
Hence, A has a non-singular (¢-+1) X (¢+1) submatrix Q, containing Q. 
If s=1, the order of Q, is r—1; and by Lemma 3.4 the indices of 
the columns of A occurring in Q, constitute a p.c.i.s. containing H. If 
s>1, we repeat the process obtaining a (¢+2) X (¢+2) submatrix Q, 
containing (, and therefore Q. After s such steps we obtain a non-singular 
(q+s) X (¢q+s) =(r—1) X (r—1) submatrix Q, containing Q, and the 
indices of the columns of A occurring in Q, constitute the desired p.c.i.s. 
containing H. 


LemMaA 3.6. Let the matrix A be such that the sequence L = (1,2, 
‘+,;r—1) wa p.c.t.s. of order w and 


and let J=(jy,- be sequence of integers, 1Sj,Sw. Then, 
(4.9) = (k=1,:--,n). 


where 6; is an element of F independent of k; namely, 


(4.10) 67 = 


10 
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Proof. By Lemma 3.4 there is an integer g such that A%z, is non-singular, 
All the r Xr submatrices of A,..~ containing A%z, are singular by (4.8), 
Hence, by Lemma 3.3 k=1,- - -,w;so that, setting k, h =1, 

kp =n, =k, (4.6) becomes + 0. Since 
L is a p.c.i.8., m0 by (4.8) and the lemma follows. 


Lemma 3.7%. Let A be as in the preceding lemma with the further 
property that a, and let K=(ki,: -,kp2) be a sequence of distinct 
integers, Then, for each 7=1,:--,n 
(4.11) OK = KK (k=1,-- -,w), 
where $; is an element of F independent of k; namely, 

(4. 12) oj = — 


Proof. By (4.4) and (4.8) we have 


Aik, Uz Ark 


Lemma 3.8. With the hypothesis and notation of Lemma 3.7%, if K isa 
p.c.t.8. for there is an integer e, 1 such that and 


for j—w+l,- --,n. 


Proof. Since the order of K with respect to A’,...~ is less than w, there 
is an integer e, 1 ew, such that a’x,.540. Hence, Ag, has maximal rank 
r—1; so by Lemma 3.4 (k,,- - -,k,2,e) is a p.c.i.s. for A. Thus, there 
is an integer g, 1 such that axe, 0. By Lemma 3.6 = 
Hence, 6-540, and therefore ¢xe; = ~0 for —=w-+1,---,n by (4.8). 


Lemma 3.9. Under the hypothesis of Lemma 3.8 there are integers f 
and gw<fSn, w<gsn, such that all the non-zero members in the set 
{Oxoj%rj}, 7 have the sign of (defined by (4.12)) and 
all those in the set {agyjar;} have the sign of — ¢y. 


Proof. If oxi;—0 for i,j7—=w-+1,- - -,n, the theorem is trivial. Hence 
we assume there are integers 6 and c, wxbn, wen, such that 
By Lemma 3.8 there is an integer e, 1 = w, such that 
for Then the array (agi) (i,7—=1,: which by 
Lenima 3.2 is a P.a., contains three rows no two of which are proportional; 
namely, R, = (¢xe1,° by Lemma 
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9.3. Accordingly, by Theorem 2 there are two rows Ry= (axp,° * *, %xpn) 
and R,—(@xo;° * *,%xgn) 8.c. With R, and not proportional to R, or to 
each other. By Lemma 2.3 


(4. 13) 90; 


and, since %xex—0 for k=1,---,w by Lemma 3.7 and eq. (4.8), f>w 
and g>w. Then for any integer j, w< such that ag, 0, 
SON by Lemma 2.1. But by Lemma 3.6 axe; = 
and %xey =9xe%r,; and therefore, since Oxe40 by Lemma 3.8, 


(4.14) SON —= SEN 

Likewise for any integer 7, w< 77, such that ag,;0, 

(4.15) SEN OK —= SON 

By Lemma 3.6 and eq. (4.12), 

(4.16) sgn = sgn = — SEN = — Sgn 


By (4.13) one of the expressions, @ref%reg&%xg ANd Aref%reg&%xgf, 18 positive 
and the other negative. Hence, we may assign the subscripts f and g so that 


SON = 1. 


Therefore, by (4.16), —sgn = 1, or SEN — Sgn dy So that 
by (4.14), 

(4.17) SQN —= — SQN dy 

for all non-zero members of the set {ax,j%z;}, 7—=w+1,---,n. Again by 


(4.16) —sgn =1, OF = SEN, SO that by (4.15), 
(4. 18) SQN SEN dy 


for all non-zero members of the set {ax j%z;}. (4.17) and (4.18) establish 
the lemma. 

Let J = (j:,° + *,jr+) be a p.c.i.s. of order w with respect to A, and 
denote by j,,jrs1,° °°, jn the integers 1,- - -,n not in J in an order such that 


Hence, the system (4.1) (of rank r) has solutions with arbitrary values 


assigned to 2;,, 2; -+,a;,,- In particular, if we define 


3 
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n 

(>| a;| 40 since J is a p.c.i.s. for A), there are elements X;j,,- - -, Xj, 
and X;, in F such that for for 


Jw jn is a solution of (4.1). Then by Lemma 3.1 and egs. (4. 20), 
(4.19), 
w n-1 n-1 
p=1 1 p= 


v=w+ 


Hence, X;, = p/sgn %J;, = We conclude that the system, 


w n 

(4. 21) Dd 4ij,Lj, = dio — Aijy VJiy (4=—=1,- 
p=1 v=wr+l 

in has the solution (X;,---,4;,). The matrix A;,..;, of this 


system contains a non-singular (r—1)-rowed submatrix A%; by Lemma 3.4; 
and by (4.19) all the r X r submatrices of A;,...;, containing A%; are singular. 
Hence, by Lemma 3.3 the rank of Ohidy is r—1. The set of equations 
(4.21) is called the derived system of (4.1) with respect to J. We have just 
proved 


Lemma 3.10. The derived system of (4.1) with respect to a p.c.1.s. 
for A is consistent and has rank r—1. 


We now define 


| Aik, 10 
j=wtl 
r—1,0 | 


w 
, , ; 
k=1 
where K and L have the meanings assigned in Lemmas 3.6 and 3.8. 


LeMMA 3.11. Under the hypothesis of Lemma 38.8, there is an integer h, 
w<hsn, such that either | or 2 | |. 


Proof. By Lemma 3.10 the derived system of (4.1) with respect to the 
p.c.i.s. L=(1,- +-,r—1) has a solution (X,,- - -,Xw), so that with the 
notation (4. 22) 


(4. 23) Dd = (1=—1,- 
k=1 


and, assuming a suitable numbering of equations (4.1), the first r—1 equa- 


w 
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tions of the derived system form a system of rank r—1. Applying Lemma 
3.1 to this latter system, 

k=1 


Then by Lemma 3.7, eq. (4.24), (4.22), and Lemma 3.7 respectively, 
(4.25) = $52 = Ko = | | 
k=1 


= (sgn gj) | | 


But by (4.23), (4.22), (X1,° +, Yim) is a solution of (4.1), 
and by Lemma 3.8 Kj = isa p.c.i.s. for A when w jn. 
Accordingly, 


(4. 26) | AKjk | = = = OK + 2 ORjkYLk 
| 


= (sgn = | | + 2 Lk 
=wt 
for j=w+1,---,n by (4.20), Lemma 3.1, (4.25) respectively. Let f 
and g be the integers whose existence was established in Lemma 3.9; define 
h=f if prw’n <0 and h=g otherwise, so that all non-zero terms in the 
last member of (4.26) have the same sign for jh. Then we obtain from 
(4. 26) 


| | >| | =| | S| | +] | oxnx |; 
k=1 k=1 k=w+1 
and hence 
k=1 k=w+l 


from which the lemma follows at once since > | ¢xx,| 0 by Lemma 3. 8. 
We now define the residual submatrix of a system of linear equations 
(1.2) with respect to %»,° * +, , as the matrix obtained by deleting the 
columns of index g:,- - -,gs from the matrix of the system, and proceed to 
state the principal theorem. 


THEOREM 3. Denote the set {1,---,n} by N; let ge} be 
a set of s integers of N, and write H=N—G. Let the matrix A of the 
system (4.1) have rank r equal to the number of equations. Also, if s<n, 
let Ay, the residual submatrix of (4.1) with respect to 2° * -,%p,, have 
maximum rank. Then the system has the minimax M¢ with respect to these 
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unknowns given by the following: (a) For 1Ssiin—r, M°=0 and the 
minimax solutions with respect to are the vectors such that 


(4.27) X%—0 (jeG); (i—1,-- 
je 
(b) For n—r<sSn, =max | p;|, where is defined by (4.20) and J 
J 


ranges over the set Tq consisting* of all the p.c.t.s. for A that contain H. 
Moreover, let E be a p.c.t.s. in Ty such that | wz |—=max|pz| and let D be 
the set of integers such that 


(4. 28) = 0 (jeD); 0 (geN—D). 

Then the minimax solutions with respect to are the vectors 

such that 

(4. 29) Xe; = yn; (ge N —D) 

where yx; 1s defined in (4.20) and the remaining components satisfy the 

equations, 

jeD jeN-D 

and the inequalities, 

(4.31) (jeD—H). 


Proof. First lett 1S=s=n—r. Then, since the matrix Ay of the 
coefficients of the unknowns 2; with je H has the same rank r as A and A, 
(4.1) has solutions with arbitrary values assigned to the remaining unknowns 
In particular, there is a solution X@ satisfying the conditions 
(4.27), and the theorem follows at once for this case from the minimax 
criterion (1.1). For n—r<ssn the proof is by induction on r. For 
For r=1 (in which case sn) the present theorem reduces to Theorem 1. 
Next let r—=R>1, n—R< sn, and assume that the theorem holds for 
r= R—1, n—(R—1)<sSn. For any solution X of (4.1) 

= 
jeN-E 
by Lemma 3.1, since az; =0 for7eH#. Applying Lemma 1.1 to this equation 
and noting that G—=N—H  N —E, we find that 


max | X;|= max | X;|=| 
jeG jeN-E 


‘T, is non-empty by Lemma 3.5. If s=n, H is empty and I, is understood to 
consist of all p.c.i.s. for A. 


[ 
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Then by the minimax criterion (1.1), if there is a solution X¢% of (4.1) 
such that 
(4. 32) max | =| pa |, 

jeG 


then M¢—|pz|, and X¢ is a minimax solution of (4.1) with respect to 
Moreover, X¢ satisfies the conditions (4. 29)-(4.31). To verify 
this last statement note that by Lemma 3.1, (4.28), 2; =X; is a solution 


of the equation, >! ap;—g0; and at the same time max | X%|<| ya] 
jeN-D jeN-D 


by (4.32), since G2N—D. Therefore by Theorem 1 and (4.28), it is a 
minimax solution of this equation and has components X°; for je N—D 
given by (4.29). Then, since X® satisfies (4.1), the remaining components 
satisfy (4.30); while (4.31) follows from (4.32). Conversely, since 
| yz;|—= any vector satisfying (4. 29)-(4.31) is a solution of (4.1) 
for which (4.32) holds. It remains to establish the existence of a vector X¢ 
such that the components XY; for je D satisfy (4.30) and (4.31). For this 
purpose it is convenient to assume the unknowns numbered so that the sub- 


scripts in H are 1,2,- - -,n—s (unless sn, in which case H is empty) ; 
the subscripts in H—H are n—s-+1,n—s+2,---,R—1 (unless 
s=n—R-+-1, in which case H —H is empty) ; and the subscripts in D— EF 
(if any) are R,R-+1,---+,w, where w is the order of H with respect to A. 
Then H—=L=(1,---,R—1), and the equations (4.21) for the derived 
system of (4.1) with respect to H become 


w 


(4. 33) = Ain (t=m1,---,R). 


j=l j=w+l 


By Lemma 3.10, assuming a suitable numbering of the equations (4.1), the 
firss R—1 equations (4.33) form a system S’ of rank R—1. For s<n, 
if dy has maximum rank, then the residual matrix A? of 8’ with respect 
tO also has maximum rank, since the R-th row of Ag 
is linearly dependent on the other rows. Hence, if s >n—(R—1), then 
by the induction hypothesis S’ has with respect to these unknowns the minimax, 
—max|p’x| as K = (k,,: - -, kr.) ranges over all p.c.i.s. for A*®,. 


containing (1,---,n—s). With the specified numbering of the unknowns 
(4.28) reduces to (4.8), and all the hypotheses of Lemmas 3. 6-3. 8 and hence, 
also 3.11 apply to A and each set K with r—R. If for any such K, 
‘vx | > |x|, then by Lemma 3.11 there would be an integer h, w<hSn, 
such that |ux,| > |pz|. But, inasmuch as (Kh) is a p.c.i.s. for A by 


Lemma 3.8 and contains H since K does, we have | |S max|py| =| pa | 
Jel, 


| 

| n 

8 

x 

to 
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=|yz|. Hence, for s>n—(R—1). This holds 
K 


for s =n — (R—1) also, since in that case 8’ has the minimax M’ — 0 with 
respect tO Zn-si1,° * *»%w, a8 we find by applying the present theorem for 
s = n—r, in which case it has already been proved. Now, if (X°%,---,X@y) 
is any solution of S’, then by Lemma 3.10 it satisfies all the equations (4. 33) 
and hence, also with the specified numbering the conditions (4.30). If in 
addition it is a minimax solution of S’ with respect to Tn-s:1,° * *,%w, then 
|X¢;|=M’S|pz| for j—=n—s-+1,---,w, and with the specified num- 
bering this is equivalent to (4.31). That is, we obtain the required vector 
X¢ by choosing (X%,- - -,X%,) as a minimax solution of S’ with respect 
tO Thus, the theorem is proved for r—=R, n—R<sSn, 
and the induction is complete. 

We remark that in determining max | pys| it is sufficient to consider 


sequences J = *,fr+) with jr+, since the value of | 
is independent of the order of the indices in J. 


3. For n—r<ssin, if D=H= if 
the determinants ap; vanish only for je H, there is a single minimaz solution 
the components X%; for je being uniquely determined by the equations 
(4.30) alone. In the notation of (4.22) with E in place of L and the 
equations (4.1) numbered so that the forms 2X Mise; (t=1,---,r—1) are 

je 


linearly independent, this solution is explicitly given by the equations, 


There remains the case that the residual matrix in question does not 
have maximum rank. This can be treated by means of the following theorem. 


THrorEM 4. Let the matrix A of the system (4.1) have rank r equal 
to the number of equations. In the notation of Theorem 3, if s<n and Az, 
the residual matrix with respect to has rank < min (r,n—S), 
let T be a sequence of t integers out of H such that Ar has maximum rank t. 
Then (4.1) has the same minimaz with respect to %,,- - -,%, as the system, 
(4. 34) = io (4==1,- aig 

jeT+G 
has with respect to Ug,,° * *,Ug, (and the latter minimaz can be determined 
by Theorem 3). For each ke H —T there are elements c,; such that 


(4. 35) D = ix 
jeT 


| 
| 
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Each minimax solution X@ of (4.1) can be derived from a corresponding 
minimax solution U@ of (4.34) by assigning arbitrary values to the com- 
ponents X¢; with je H—T and determining the remaining components by 
means of the equations, 
(4. 36) XG, (je@), XG; DY cy X% (jel). 
keH-T 

Proof. For any integer ke H—T the augmented matrix of (4.35) 
(considered as a system of equations in the unknowns c¢,;), being the same 
as a submatrix of Ay except possibly for a permutation of columns, has the 
same rank ¢ as the matrix Ar of the system. Consequently there are elements 
c,j satisfying (4.35). If X is a solution of (4.1) and elements Uj are 
given by 
(4.37) Uj= (jeG@), U;=Xj+ (jeT), 

keH-T 

then u; =U; is a solution of (4.34), since by (4.37), (4.35) 

jeT jeT jeT keH-T jeT keH-T jeH 
Conversely, if wu; U; is a solution of (4.34) and elements XY; for je H —T 
are chosen arbitrarily, then elements X; for je 7+ G are uniquely determined 
by (4.37) and the resulting vector (X,,---,Xn) is a solution of (4.1). 
In particular, if u; =U; is a minimax solution of (4.34), then the vector 
X¢ given by (4.36) with X%; arbitrary for je H —T is a solution of (4.1). 
Moreover, it is a minimax solution; for, if Y is any other solution of (4.1) 
and the elements Uy are determined by (4.37), then, as noted above, u;—=U,; 
is a solution of (4.34) and hence, 


max | X; | = max | U; | = max | U%; | =| max | X; | (jeG@). 
j j j j 


Conversely, X¢ is a minimax solution of (4.1) only if the elements U%; 
determined by (4.36) are the components of a minimax solution U@ of 
(4.34). Finally, the matrix of the system (4.34) has rank r equal to the 
number of equations, since it is obtained from A by deleting columns linearly 
dependent on those occurring in Ay; and the residual matrix with respect 
to Uy,,* * *,Ug, is Az, which has maximum rank by hypothesis. Accordingly, 
(4.34) does indeed have a minimax solution U@ by Theorem 3. Hence, (4. 1) has 
@ minimax solution X¢ and for both systems M¢ = max | X%; | = max | U; | 
(je @). 


5. The minimax of a system of linear functions. The minimax and 
minimax points of a system of linear functions over F, 


| 

f 
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5 
q ° 

j= 


can be determined by applying the theorems of the preceding section to the 


system of linear equations, 


qd qtr 
j=l 


where 8); is the unit element of / when h =j and is the zero element other- 
wise. All solutions of (5.2) are given by 


(5.3) (j=1,°: = (X1,° *,Xq) 
gG=q+1,: 


in which X,,- --,X, are arbitrary elements of F. Consequently, the mini- 
max M*(L) of (5.1) is the same as the minimax of (5.2) with respect 
tO Moreover, (X*,,- - -,X*g) is a minimax point and 
(L*,,- - -,L*,) a minimax vector of (5.1) if and only if (X*1,-- -,X*;, 
L*,,- - -,L*,) is a minimax solution of (5.2) with respect to %i1,* °°, Ugur. 

For convenient expression of the results we introduce some additional 
notations. If the rows of index i,,- - -,7, are deleted from a matrix B, we 
denote the resulting matrix by B**:, If the superscripts are all distinct 
and the latter matrix is square, 6%‘: is its determinant and otherwise is 
zero. Let B be rXq and, if g<r, let T= (t,- - -,%-¢-1) be a sequence 
of r—q—1 distinct integers, 1=1,r. The number d(I) of zeros in 
the sequence (B!,- - -,B!") we call the degree of I with respect to B. 
Clearly, d(1) =r—q—1. If d(I) <r, we call I a proper row tinder 
sequence (p.r.i.s.) for B. 


THEOREM 5. Let the matric B= (bij) of 
the system of linear functions (5.1) have maximum rank, and denote by By 
the matrix obtained from B by adjoining the elements bio as the (q-+-1)-th 
column. Then the system (5.1) has a minimar M*(L). If gr, M*(L)=0 
and the minimax points X* are those for which L,(X*,,- - -,X*q) =0 
(t=1,---,r). Ifq<r, M*(L) =max | p!!, where I ranges over all p.r.i.s. 
for B and yp! is defined by 


pl = | |. 
Let V bea p.r.i.s. such that | | = max | p! |; let R denote the set {1,---,r}; 
I 


U the set of integers te R such that BY'=0, and €(V,1t) the number of 


SOME LINEAR MINIMAX PROBLEMS OVER AN ORDERED FIELD. 559 


integers in V exceeding i. Then the minimax points X* are those which 


satisfy the conditions, 
(5. 4) L,(X*;, (— sgn (te R— U). 
(5.5) | S| | (ieU). 


Proof. For g=r the theorem is immediate. To establish it in the case 
q<r we proceed to determine the minimax M® of (5.2) with respect to 
using the notation of Section 4 with aj;—b, for 7—0, 
and for The rank -of ‘the 
matrix A of (5.2) turns out to be 7, the same as the number of equations ; 
and the residual matrix with respect to these unknowns is B, which has 
maximum rank by hypothesis. Thus, M@ can be determined by the case 
n—r<ssn of Theorem 3. Noting that for a sequence of integers 


= (— 1) 
p=3(r—g) (r +943) tats: te (1, 

= (— 1) 
Pom 4(r—Q—1)(r+q+2) tat: 


and hence, that J is a p.c.i.s. for A containing H = {1,- - -,q} if and only 
if + and J is a p.r.i.s. for B, we have 
py = (—1)?u! for JeTy. Consequently, M@ and hence, also M*(L) is 
ara |»! | as I ranges over the p.r.i.s. for B. Moreover, letting 


we have | wz | = max |p,|. Hence, in the notation of Theorem 3,q-+teD—H 
Jel, 


is equivalent to ie U, and q+ieN—D is equivalent to ic R—U, since 
n,q4;=0 if and only if BY’ 0. Then for any minimax vector X* of (5.1) 
and any minimax solution X¢ of (5.2) it follows from (4.29) that 
X*,, Xo = VEq+ti = PE sgn AEg+i 
(— sgn (ie R— U), 
and from (4.31) that 


| : | | 


(ieU). 


Since (4.30) is then satisfied automatically, the conditions (5.4), (5.5) 
characterize the minimax points X* completely. 
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Corotuary 5. If U=V, t.e. if none of the determinants BY* (1¢ V) 
vanish, the system (5.1) has a unique minimax point X*, namely, the (in 
this case) unique solution of (5.4). 


To determine the minimax of (5.1) when the matrix B does not have 
maximum rank, we employ Theorem 4 and obtain 


THEOREM 6. Let the rank of B be t<min(q,r), and let T be a set 


of t integers out of H = {1,- - -,q} such that the submatrix By has maximum 

rank t. Then (5.1) has the same minimaz as the system, 

(5.6) Dd bijuj — dio (1=1,- 
jez 


with matrix of miximum rank. For each ke H—T there are elements cy; 
such that 


jeT 
Each minimax point X* of (5.1) can be derived from a corresponding 
minimax point U* of (5.6) by assigning arbitrary values to the components 
X*; with je H—T and determining the remaining components by means 
of the equations, 
X*;=U*;,— (eT). 
keH-T 

6. Problems in best approximation. The foregoing results can be 
applied to certain problems of best approximation in the Tchebycheff sense. 
For example, if as an alternative to the usual least squares criterion we 
define a best approximate solution x* of an inconsistent system of linear 


equations, 


q 
> = dio (1=1,- 
j=1 


as a set of values for z which minimizes the largest of the “errors” 
| bio | - -,7), then the vectors are precisely the minimax 
j 


points of the system of linear functions (5.1) and hence, are determined by 
Theorems 5 and 6. 

Next we consider the problem of best approximation to a function at a 
finite number of points. Let S, be a set of p points %,: - -,%») in a com- 
pletely arbitrary space, and let ® be the class of all single-valued functions 
defined on S, with range in the ordered field F. Given a subclass G of ® and 
a function $(z) ¢®, we define a G-function of best approximation to ¢ on Sp 
as a function g*(x)eG which minimizes | g(x) —¢(z)| as g ranges 

weSp 
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| 


SOME LINEAR MINIMAX PROBLEMS OVER AN ORDERED FIELD. 561 


over G. If g:(x),° + *,9q(x) are given functions in ® and we take G as 


the class of functions expressible in the form > cgs(2) with cje Ff’, then® 
g* (2) = where (c*;,- - -,c*,) is a minimax point of the system 
of linear 


and so again is determined by Theorems 5 and 6. The particular cases 
= 2) and gox(x) = cos kz, =sin ke yield the results of De La 
Vallée Poussin [5] on best approximation on sets of p real numbers by poly- 
nomials of degree n=q—1=p—2 and by trigonometric sums of order 
n= (q—1)/2S (p—2)/2 respectively, if we note that in these cases none 
of the determinants /+ vanish so that Corollary 5 is applicable and there is 
a unique minimax point. 

Finally, we determine the G-function of best approximation to ¢ on Sp 
when G@ is the class of all functions ge whose values on S, are subject to a 
set of linear conditions, 


j=l 


in which the c¢;; are given elements of ¥. An important instance is the class 
of all functions in ® whose n-th divided differences on S, have preassigned 
values, [2i, —Cio for 1—=1,---,p—n. Then (6.1) holds 
with p=r-+n, 


k 


for j=1,1+1,---,i+n and c¢j;—0 otherwise,® provided S, is a subset 
of F. For this choice of the class of approximating functions the values g* (2) 
on Sp are given by 


g* = + $(2)) -,p); 

where y= Y* is any minimax solution of the system of linear equations, 
p 
= Cio — Cijh (25) (t=1,---,r), 
j=1 j=1 

and can therefore be obtained by means of Theorems 3 and 4. 

UNIVERSITY OF GEORGIA. 
°This problem has recently been treated for q=p-—1 in the field of complex 


numbers by T. S. Motzkin and J. L. Walsh [8]. 
* See for example [7]. 
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CONVERGENCE IN AREA OF INTEGRAL MEANS.*? 


By CasPER GOFFMAN * 


1/n 
The integral means f,(2z, y) =n? J n=1, 
0 0 


2,---, of a function f(z,y) have been used as a smoothing approximation 
to f(z,y) in a variety of applications. Their use in surface area theory seems 
to have been first made by T. Radé, [1], who showed, for continuous f(z, y), 
that the Lebesgue areas, A(fn), of the surfaces given by fn(z, y), nm =1,2,° - -, 
converge to the Lebesgue area A(f) of the surface given by f(z, y). 

For summable f(z,y), the Lebesgue area is in general meaningless and, 
in previous papers, L. Cesari [2] and the author [3] have introduced, in 
different ways, generalized Lebesgue areas, (f) and #(f), which were proved 
in [3] to coincide for every summable f(z,y). Our main purpose is to show 
that the result of Radé mentioned above concerning the f,(x, y) holds for every 
summable f(z,y) provided the Lebesgue area is replaced by the generalized 
Lebesgue area. Indeed, for every summable f(z, y), the integral means f,(z, y) 
are continuous so that their Lebesgue areas A(f,) do exist. What we show 
is that the sequence {A(f,)} either converges, or diverges to 4-0; i.e., it 
converges on the extended closed interval [0,-+-co], and the limit coincides 


with the generalized Lebesgue area: 


lim A (fn) = —o(f). 


It follows that the limit above can itself be interpreted as a definition of 
“the area of the surface defined by f(z, y).” 

The area H(f) of the paper [3], where we proved that 6;.-—6—H, 
is denoted by x(f) in the present paper. The area ®(f) of the present paper 
was not considered in [3] and we prove that ®(f) = x(f). 

For convenience, we say that a function f(z,y) has property R if the 
sequence {A(f,)} of Lebesgue areas of its integral means converges on the 
interval [0,-+co]. We thus show that every summable f(z,y) has property 


* Received November 3, 1954. 
* Research supported by a grant from the National Science Foundation. 
* We are grateful to the referee for his thoughtful criticism of the first version 
of this paper. 
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R. The property R is defined similarly for representations of parametric 
curves and surfaces. We shall show also that every f:2(t), 1=1,2, 
0<t<1, 2;(t) summable, has property R. For parametric surfaces, the 
situation is difficult, as may be expected. For this case, we extend a result 
of C. B. Morrey [4], [5], [6], but shall omit the proofs since they are 
essentially the same as for the continuous case. 

We first show that property R is general for curves; i.e., if z(t), y(t) 
are summable functions on the closed interval [0,1], with integral means 


1/n 1/n 
=n u)du and y,(t) =n f y(t + u)du, n=1,2,- - -, then 
0 0 


the sequence of lengths of the curves given by t,(t), yn(t) converges on the 
closed interval [0,-+ 0 ]. 
The generalized variation ¢(z) of a summable function x(t), on [0,1], 
was defined in [3]. One of several equivalent definitions is the following: 
For any finite set S:0 << tp) <t, tn <1, let the norm, v(S), be 
the largest of the numbers fo,t;—%,- - -,i—t,. Let 


$(2;8) —> | 2(t:) —2(tra)|- 


Now, let H be the set of points of approximate continuity of x(t), and let 
¢(z) =sup¢(z;8) for all SCH. The following result is a simple con- 
sequence of this definition: 

If (xz) < +0, then for every « >0 there is a 8>0 such that 8 CE 
and v(S) <8 implies ¢(z;8) >¢(xr) —e. If =-+ 0 then for every 
M > 0 there is a 8>0 such that S C F and v(S) <8 implies ¢(z;8) > M. 

The function ¢(x) agrees with the variation V(x) for continuous z(t) 
but not necessarily for discontinuous z(t). However, it is always true that 
<V(z). 

The integral means 


=n + u)du and y,(t) = n + u)du 


are defined on the interval [0,1—1/n], are absolutely continuous, and so 
are differentiable almost everywhere. It follows by the fundamental theorem 
of Lebesgue integration that the derivatives are given almost everywhere by 


(t) =n{x(t+1/n) —2(t)} and y,’(t) =n{y(t+1/n) —y(t)} 


so that, since z,(¢) and y,(t) are absolutely continuous, 


*1-1/ 


$(2n) = V 


/n 1-1/n 
f +1/n) —a(t)| dt 
0 
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and 
1-1/n 
=n | y(t +1/n) — y(t) | dt. 


We show that the sequences {¢(2,)} and {¢(yn)} converge on the closed 
interval [0,-+-co]. It is only necessary to show this for {f(2n) }. 
Let us first prove that (tn) =¢(x). Indeed, we have 


< nf" x(t +1/n) —2(t)| dt 


=n f° +i/n) — x(t + (t—1)/n) dt. 


For almost all ¢, 0S t= 1/n, the points t, ¢-+-1/n,- - -,t+ (n—1)/n are 
all points of approximate continuity for 7(t), and hence, the sum under the 
last integral does not exceed (x). Thus: 


Sn 


Suppose (2) <oo. Let e>0. By the above remark, there is a >0 
such that SC and »(S) <8 imply ¢(z;8) But, for all 
n> 1/8, and all te [0,1/n], the set t,¢-+1/n,- - -,¢+ (n—1)/n has norm 
less than 8 and, for almost all ¢, it is a subset of FE, so that 


| a(t +i/n) —2(t + 
Hence, 


a(t+1/n) —x(t)| dt =¢(an). 


0 
Thus, for every n > 1/8, we have ¢(2) —e < so that } 
converges to d(x). If d(x) =-+ 0, a similar argument shows that {¢(2n) } 
converges to +c. 
Now, for each n, the pair z,(t), yn(t) represents a continuous curve C, 
whose length is given by 


flee + 1/n)—a(t)}? + {y(t + 1/n) 


+ {y(t + i/n) — y(t + dt. 


n-1 
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If, now, Z is the set of points at which both x(t) and y(¢) are approximately 
continuous, and f is the pair x(t), y(t), we may let 


for each S:0 < ly) <t, < ty <1 and then let y(f) —supy(f; 8) for all 
SCE. As for ¢(f), we again have: If y(f) < +0, then for every « >0 
there is a 8 > 0 such that if v(S) <4, and S C L£, then y(f;8) >u(f) —e; 
if y(f) =-+ 0, then for every > 0 there is a 8 > 0 such that if v(S) > 6, 
and C #, then y(f;8) > 

It then follows, just as for ¢(x), that if y(f) < +0 then for every « >0 
there is an N such that n>WN implies y(f) —e< L(C,) <yw(f), and if 
¥(f) +o, for every M>0 there is an N such that n>WN implies 
L(C,) > M. We may now state: 


THEOREM 1. Jf x(t), y(t) are summable functions defined on [0,1], and 
1/n 
C,, is the curve corresponding to the integral means x,(t) =n a(t + wu) du, 
1/n 70 
ya(t)—n y(t + u) du, n=1,2,-- then the sequence {L(C,)} of lengths 
0 


converges on [0,+00]. In other words, every summable x(t), y(t) has 


property R. 


Let f(x,y) be a summable function defined on the unit square 
I=[0,1] x [0,1]. For every natural number n, the integral mean 


1/n 
f.(z,y¥) = f(a@+u,y+v)dudv is defined on the square 
0 0 
I, = [0,1—1/n] X [0,1—1/n]. 


It is continuous, and is absolutely continuous in each variable for almost all 
values of the other variable; moreover, it is well known, and easy to prove, 
that its partial derivatives are given almost everywhere by the expressions 


(1) Ofn (x,y) = n° f "(a+ +) + 0)} dv 


1/n 
(2) +1/n) —f(e+u,y)} du, 
0 
For continuous f(x,y), [5], the derivatives df,/dr, df,/éy are given every- 


where by (1) and (2) and it is obvious that they are continuous. 
It follows easily that 0f,/dx and df,/dy are summable. For: 


i 
i=1 

1 
0 
V 
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1-1/n »1-1/n 
f j | Of n/Ox | dx dy 


0 


»1/n 1-1/n 
<n’? f dv{ [| f(a +1/n,y+v)| + | f(x,y + v)] dx dy} 


1-1/ 
"tnt f | f(a + 1/n,y + v)— f(x,y + v)| dv} dr dy 
0 


0 


e 


0 7 0 «7 0 
1 1 pl 
= 2n? f dv f f | f(x, y)| dx dy = 2n f f | f(x, y)| dx dy, 
e 0 0 0 79/70 


and the result follows since f(x,y) is summable. 

Moreover, since f,(z,y) is absoluately continuous in each variable for 
all values of the other variable, it is ACT and so: 

If f(x,y) is summable, the surfaces given by the integral means 
fn(z,.y), M=1,2,- - -, all have finite Lebesgue areas given by 


1-1/n 1-1/n 1/n 
(83) AG)— fF + +0) + ao}? 


f (f(a-+ u,y + 1/n) — f(x + u, y))du}*]4 dx dy. 


0 
We shall show that the sequence (3) converges on the closed interval 
[0,-+-co]. It seems worth noting that the technique used in proving Theorem 
1 now yields an inequality instead of an equality. Thus, a separate argument 
is needed to prove the opposite inequality. Since f(2,y) is measurable, there 
are subsets H, and EF, of [0,1], each of measure 1, such that for every ze L;, 


ye L,, f(x,y) is approximately continuous almost everywhere in the other 
variable. 

Just as we did in the case of functions of one variable, we first prove 
our result for variation functions ®,(f) and #,(f), which we now define. 
Let (f;x;c¢,d) be the generalized variation of f(z,y) for x fixed and y 
varying on the interval [c,d], and let ¢(f;y;a,6) have similar meaning. 
We note that these functions are essentially additive; i.e. if c<c’ <d and 
f(x.y) is approximately continuous in y at the point (z,c’), then 


+ =$(f;23¢,d). 
We let 


d 
f o(f;x;c,d)dx and ®.(f;a,b;c,d)= b)dy. 


4 


In particular, we write. 


®,(f) =,(f;0,1;0,1) and &,(f) =.(f;0,1;0,1). 
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For every a, 6, c, d, with OSa< 6=1, 0OSc<d=l, the functions 
@(f;2;c,d) and o(f;y;a,6) are “defined almost everywhere, have values in 
the interval [0,-+0], and are measurable, and so the above integrals have 
meaning. 

The function f(z,y) is of generalized bounded variation if ®,(f) and 
#,(f) are both finite; otherwise, it is of generalized unbounded variation. 
For continuous f(z,y), our definition agrees with that of bounded variation 
in the sense of Tonelli, and in the general case, agrees with Cesari’s definition 
[2] of a function of type gBVT (or of generalized variation in the sense of 
Tonelli). 

Since, for every n, the integral mean f,(z,y) is ACT, it follows that 


1-1/n 1-1/n 
®2(fn) f f | Of, /Oy | dz dy 
0 0 


1-1/n 1-1/n 
and 2(fn) | Ofn/dx | dx dy. 
0 0 


We retain the convention of convergence on the closed interval [0, + ], 


and we prove 


THEOREM 2. For every summable 9), the integral means f,(2, y) 
satisfy 
lim ®,(fn) =®,(f) and o(fn) = ®.(f). 


n> 


Proof. We prove the theorem for ®.(f). Since 


1/n 
(f(a+ 1/n,y +v) —f(a,y +0) }dv 


almost everywhere, it follows that 


ff 


1-1/n 1-1/n 
0 70 0 


1/n 
{f(z + 1/n,y + v)—f(x,y + v)}dv | dx dy 
0 


1-1/n 1/n n-1 


-1/nn-1 


v+1-1/n 1/n 
=n? “aw f dt >| f(x +k/n, t) — f(x + (k —1)/n, t)| dz. 


For every v and for all te #,-[v,v+1—1/n], that is for almost all 
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te[v,v-+1—1/n], we have: for almost all z, the points (x, t), (x +1/n, t), 
--+,(a+(n—1/n,t) are all points of approximate continuity for f(z, y) 
with respect to z, and hence: 


Then for the same v and ¢t we have 


J fe +4/n, ) + (k—2)/n,t)| S90), 


and finally 


1/n v+1-1/n 
f "dv f dt-n-*4(f, t) 
0 v 


v1 1/n 1 
=n? dt dv: n-*9(f, t) = t)dt = ®,(f). 
Jo 0 0 
We have thus shown that #.(f,) =®2.(f), for every n, so that 
lim sup ®,(f,) S ®2(f). 


It remains for us to show that liminf®.(f,) 2@.(f). For this, we 


n> 


need the well known fact that f,(x,y) converges almost everywhere to f(z, y) 
so that it converges almost everywhere in each variable for almost all values 
of the other variable. 

We suppose that ®.(f) <<-+ and let e >0. We then have ¢(f;y) <<-+ 
for almost all values of y, and for every such y there is b(y) <1 for which 


$(f3y30,b(y)) > o(fsy) —€/2, 
where $(f;y) =¢(f3;y;0,1). It follows that there is a b <1 such that, 


a 
f, (f3y;0,b)dy > ®.(f) —«/2. 


Moreover, there is a <1 such that 


> &,(f) —€/2. 


Let n, be a positive integer such that ny" < min (1—a,1—b). It is easy 
to see, and proved in [3], that the generalized variation ¢(f) of a function 
f(x) of one variable is lower semicontinuous with respect to almost every- 
Where convergence. It follows that, for almost all values of y, we have 


lim inf = (f3y;0,b). 


n> 


> | + k/n, t)— fle + (k—1)/n, t)| S 
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Hence, for almost every y, there is an n(y) such that n > n(y) implies 
$(f3y;0,b) > o(f3y;0,b) —e/2. 


There is then a single NV, which may be taken so that N > m, such that for 
n>WN, 


1-1/n : 1/n 
y 39, b)dy = 3430, b)dy —e/2 
0 0 


= f, 3430, b)dy —«/2= ®,(f) —e. 
0 


This completes the proof for the case ®.(f)<-+ The case ®.(f) = 
provides no additional difficulties, so that Theorem 2 is proved. 
Now, for every n, we have the inequalities 


®1(fn) + 22(fn) +1ZA(fn) = max [®. (fn), |, 


where A(f,) is the Lebesgue area of the integral mean f,(z,y). It follows 
that, if f(z,y) is of generalized unbounded variation, the sequence {A(f,)} 
converges to +0, so that f(z,y) has the property R. 

To show that {A(f,)} converges when f(z,y) is of generalized bounded 
variation, we consider the generalized Gedcze expression W(f). A subdivision 
p of J into rectangles R,,R.,- --,Rm by means of lines parallel to the 
coordinate axes is admissible if f(x,y) is approximately continuous in @ for 
almost all y on all boundary lines of the rectangles of the subdivision parallel 
to the y axis, and in y for almost all z on all boundary lines parallel to the 
x axis. The meaning of the notation ,(f;R;), ©.(f;R:), +=1,2,- -,m, 
should be clear, and it follows from the essential additivity of the generalized 
variation function ¢(f;a,6) that 


#,(f) and #.(f) 


We then consider the expression 


i= i= 
where | R;| is the area of R;, and then let 
v(f) =sup¥(f;p) for all admissible subdivisions p. 


Since ¥(f) =4,(f) + @.(f) +1, it follows that ¥(f) <-+ 00. For con- 
tinuous surfaces, W(f) is the Gedcze area so that, in particular, for the 
integral means of our summable f(z,y), we have ¥(f,) =A(fn). 


™ m 


— 


vj 


i- 


571 


CONVERGENCE IN AREA OF INTEGRAL MEANS. 


Again using the fact that if f(2) is of bounded generalized variation, 
then for every « > 0 there is a 8 >0 such that if S C # and v(S) <8, then 
o(f;S) <(f) —«, where E is the set of points of approximate continuity 
of f(x), we obtain an alternate definition for (f). Indeed, as we shall 
prove, ¥(f) = x(f), where 


x(f) =supx(f3p) for all admissible subdivisions p, 


and 


x(f3 p) {xo(f3 Bi) + | Bi 


where 
f | f(bi, y) — f(a y)| dy 


Cé 
and Ri) = f | f(a. — «)| da, 


the rectangle being given by a 


Analogously, if 
xi (f3e) = (fs Ri), x2(f3e) = Bi), 
let 
xi(f) =supyxi(fsp),  —-x2(f) = sup xe(f 3p) 


for all admissible subdivisions. Obviously, we have 
xi(f) t—1,2, x(f) S¥(f). 


We show that ¥(f) —lim¥(f,) =limA(f,). We first let and 


n> n—> 


let p be an admissible subdivision R,, R2,- - -,R» for which V(f;p) > ¥(f) —e. 
By Theorem 2, 


lim inf ©, (f,;R;) = ®,(f;R;) and liminf®.(f,;R;) = &.(f;R;), 


n> 
for every i=1,2,:--,m, so that liminf(f,;R;) =W(f;R;). It follows 
n> 
that liminf ¥(f,;p) =V(f3;p) =W(f)—e and, since «>0 is arbitrary. 
lim inf A )2v(f). 
n> 
Conversely, lim sup A (fn) = x(f). In showing this, we use the expression 


n> 


(3) for A(fn). Then 


| 
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A(fn) = 4 {n? f, + 1/n,y + v)—f(z,y + v))dv}? 
{nt uy + 1/n)— + y))du}*] de dy 


“de dyS + (e+ 4 0) 


,, G—1) 


—f(e+ 


+f 


But, for almost all (2, y) 1—1/n| X 1—1/n], the subdivision 
obtained by means of the lines 


is admissible, so that 


G—1) 


n 


y+ + v))dv}? 


It then follows, since A(f,) is the mean value of the left hand side of 
this last inequality, that A(f,) Sx(f), for every n=1,2,--- so that 
lim sup A(f,) = x(f). Thus we have 


lim sup A (fn) Sx(f) S¥(f) Slim inf A(f,), 
and hence 
lim A (fa) =x(f) = ¥(f) 
Analogously, we can prove that y,(f) =®,(f). yo(f) =®.(f). This 
completes the proof of 


THEOREM 3. Every summable function f(z,y) has the property R. 


According to L. Cesari [2], the function f(z,y) is said to be of type 
gBVT, or of generalized bounded variation in the sense of Tonelli, if when a 


of 
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certain set of measure zero is neglected, the function is of bounded variation 
with respect to each variable on the remaining set for almost all values of 
the other variable, and these modified variation functions are both summable. 
As proved in [3], the summable function f(z,y) is gBVT if and only if 
xi (f), x2(f) are both finite. The considerations above prove also that f(z, y) 
is gBVT if and only if ®,(f) and ®,(f) are both finite. 

We turn now to parametric surfaces. C. B. Morrey [4] obtained results 
regarding the integral means for a parametric representation f:2;(u,v), 
i=1,2,3; perhaps his main result is that if the z(u,v) are all ACT and 
if each pair of partial derivatives 02,/0u, 02;/0v appearing as a product in 
one of the jacobians belongs to conjugate L, spaces, then the jacobians of 
the integral means 2;"(u,v) of x;(u,v) converge almost everywhere, and in 
L, to those of z(u,v). In particular, it follows that f has property R. The 
following fact is important in this connection: If f(z,y) is ACT and if 
is the n-th integral mean of f(x,y), then df,(2, y)/dx and y)/dy 
are the n-th integral means of Of(x,y)/dx and Of(x,y)/dy almost everywhere. 
We observe that this holds for f(z,y) of type gACT, but the proof is not 
very different from the one for the continuous case, and so it will not be 
given. The definition of a gACT function [3] is the same as that of an ACT 
function except that the continuity restriction is waived. 


The following extension of Morrey’s theorem holds. 


THEOREM A. A mapping f:2;(u,v), 1=1,2,3, has property R tf the 
z(u,v) are gACT and one of the following conditions is satisfied: 


a) All partial derivatives dx;/du, 0x;/0v, i=1, 2,3, are in Lp. 


b) At least two of the x;(u,v) have bounded partial derivatives and 
the others are integrable. 


c) Hach pair 0x;/du, 0x;/0v appearing as a product in one of the 
jacobians belongs to conjugate Lp spaces. 


It is understood that sets of measure zero may be neglected in calculating 
partial derivatives appearing in this theorem. The proofs are then much 
like the continuous case, and are omitted. 

In conclusion, we wish to point out that there are mappings which do 
not satisfy any of the above conditions and still have property R. For example, 
we consider the Gedcze mapping f:2;(u,v), i=1,2,3. Here, let f, (2), 
be a continuous mapping on [0,1] whose image covers the unit square; and 
let 7,(u.v) =f; (u). =f.,(u). 7,(u.v) =0. The functions 7, (u.r), 


Ly 
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z(u,v) are not of type gACT. We observe that the integral means 2,"(u, v) 


are zero so that 
*1-1/n e1-1/n 
A (fn) J | £2") /O(u, v) | du dv. 
0 0 


But 2;(u,v) =2(u+h,v), i=1,2, for every u,v,h, so that 


~1/n 21/n 
J dédy= J a(uth +év+7) dé dn, 
0 0 0 0 


and 2;"(uth,v) =2;"(u,v). It follows that dr;"/du=0, i=1,2; 


2,-- wand v arbitrary. Thus A(f,) —0, so that f has property R. 
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ON THE RAMIFICATION OF ALGEBRAIC FUNCTIONS.* * 


By SHREERAM ABHYANKAR. 


Introduction. The investigations of the present paper arose from our 
(unsuccessful) attempt to adapt the ideas of Jung’s classical proof of the 
theorem of local uniformization on algebraic surfaces over the field of com- 
plex numbers ? to the solution of the local uniformization problem for algebraic 
surfaces over fields of nonzero characteristic.* To explain matters, let us 
consider the following situation. Let 7 be a rational transformation from an 
r-dimensional normal algebraic variety V onto an r-dimensional nonsingular 
algebraic variety W such that T and 7-* are free from fundamental points. 
Let the ground field & be algebraically closed of characteristic p. Let P and 
Q be corresponding points on V and W respectively and let B be the branch 
locus of 7-t on W. Let # and F be the quotient fields of the completions 
of the quotient rings of P and Q on V and W respectively. Then F can be 
canonically considered to be a finite separable algebraic extension of Ff. Let 
E* be a least Galois extension of F containing #. Let us denote by G(P/Q) 
the local Galois group of P over Q, i.e., the Galois group of L* over F. 

The basic facts underlying Jung’s proof, then, are that for p—0O and 
r= 2 we have: (1) If Q is a simple point of B then G(P/Q) is cyclic and 
P is simple for V. (2) If Y is an ordinary double point of B then G(P/Q) 
is a direct product of two cyclic groups. We shall give algebraic proofs of (1) 
and (2) in Section 2, and shall show in Section 3, by examples, that these 
statements are no longer true if p40; namely we shall show that P may be 
a singular point of V even if Q is a simple point of B, and that the local 
Galois groups above simple or ordinary double points of B can be very com- 
plicated and even insolvable. The collapse of statements (1) and (2) is the 
reason for the nonadaptability of Jung’s method of uniformization to the case 


* Received April 6, 1955. 

*The material of this paper together with paper [1] forms the author’s Ph. D. 
thesis at Harvard University. The work was supported by a research project at 
Harvard University sponsored by the Office of Ordnance Research, U. S. Army, under 
Contract DA-19-020-ORD-3100. 


* See [4]. For a simplified version of Jung’s proof see [10]. 
* We have proved the theorem of local uniformization on algebraic surfaces over 
fields of nonzero characteristic in [1]. 
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of ground fields of nonzero characteristic. An algebraic treatment of the 
main parts of Jung’s proof will be given in Section 4. 

Let now pO and assume that B has a ¢-fold normal crossing at Q 
with t=r (for definition see Section 3). Let +(P/Q) be the smallest sub- 
group of G(P/Q) containing all the p-Sylow subgroups of G(P/G). Then, 
as will be proved in Section 2, the factor group G(P/Q)/m(P/Q) is a direct 
product of ¢ cyclic groups. This characterization of local Galois groups above 
normal crosings is the main result of this paper. Also we shall include in 
Section 1, Zariski’s own proof of his theorem about the purity of the branch 


locus. 
My warmest thanks are due to Professor O. Zariski for his kind encourage- 


ment and valuable advice. 


Notations. We shall use the following notations. For a valuation v 
of a field Z we shall denote by FR, the valuation ring of v and by M, the 
maximal ideal in R,. We shall say that an integral domain A is normal, if 
A is integrally closed in its quotient field. If R is a local ring and J its 
maximal ideal, we shall express this by saying, “(R,™M) is a local ring.” 
If RF is the quotient ring of an irreducible subvariety of an algebraic variety, 
we shall say that FR is algebraic. If W is an irreducible subvariety of an 
algebraic variety V, we shall denote by Q(W,V) the quotient ring of W on 
V and by M(W,V) the maximal ideal in Q(W,V). When the reference to 
the embedding variety V is clear we shall simply write Q(W) and M(W) 
for Q(W,V) and W(W,V) respectively. Unless otherwise stated, by a point 
we shall mean a rational point over the ground field under consideration. 


1. The branch locus. The basic definitions and some preliminary results 


needed in the present paper were given by us in Section 2 of [1]. In this 


section we add a few more preliminary results. 


In the following two lemmas we collect some well known facts about 
derived normal models. For proof of these lemmas we refer to Zariski’s , 
papers: see the remarks on pages 68-70 of [13] and the discussion of the ; 
birational case on pages 506-511 of [11]. 

LemMA 1. Let K be an r-dimensional algebraic function field over an ; 
algebraically closed ground field k, and let K* be a finite separable extension ' 
of K. Let V and V* be normal projective models respectively of K/k and ‘ 
K*/k such that the rational transformation T from V* onto V (determined ‘ 


by the embedding K C K* of the function fields of V and V*) and its inverse 
Then: (a) To an irreducible 


map T-* are free from fundamental points. 
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subvariety W* of V* corresponds a unique irreducible subvariety W of V. 
Furthermore dim W* = dim W and Q(W*) D7 Q(W). (b) To an trreducible 
subvariety W of V correspond a finite number of irreducible subvarieties 
W*,, W*.,- - -,W*, of V*. Let (R,M) and (R*;, M*;) be the quotient rings 
of W*; and W respectively. Then R*,, R*.,- - -,R*, are exactly the local 
rings in K* lying above R. 

Lemma 2. Given K/k, K* and V as in Lemma 1, there eatsts V* as 
in Lemma 1. Furthermore, V* is unique up to a biregular transformation. 


DEFINITION 1. We call V* a derived normal model of V in K*/k. 
Now let the notation be as in part (b) of Lemma 1. For a fixed index a, 
let E and E*, be the quotient fields of the completions of R and R*, 
respectively; and embed E canonically in E*,. Recall from Section 2 of [1] 
that: 


d(M*,:M) g(M*a:M) = [(R*./M*.) : (R/M) Jo, 
i(M*,:M) = r(M*a: M) = d(M*,: M)g(M*,: M)>, 


7(M*,: M) =r(M*,: M)i(M*,: M)". 
Now we define: 


d(W*,: W) =degree of W*, over W=d(M*,:M); 

g(W*,: W) =separable residue degree of W*, over W=g(M*,:M); 
i(W*,: W) =inseparability index of W*, over W=i(M*,:M); 
r(W*,: W) =ramification index of W*, over W=r(M*,:M); 
7(W*,: W) = reduced ramification index of W*, over W =7(M*,:M). 


We shall say that W*, is ramified over W—or that W*, is ramified for the 
transformation T (or T-t)—1f M*, is ramified over M. Also we shall say that 
W is a branch variety of V for the transformation T (or T-*) if M is a branch 
ideal in R for the extension K C K*. 


Then W is a branch variety of V for the transformation T (or T-?) if 
r(W*,;:W) >1 for some i or equivalently if (W*;: W) < By... By 
i=1 


the branch locus of T or T-1, denoted by B(7’) or B(T-*) we shall mean 
the set of all (rational) points P of V/K such that P is on some irreducible 
subvariety of V/k which is a branch subvariety for T. 


Lemma 3. Let K/k, K*, V, V* and T be as in Lemma 1. Then an 


4 
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irreducible subvariety W of V is a branch subvariety if and only if W C B(T). 
Furthermore, B(T) is a (proper) subvariety of V. 


Proof. Let A be an affine coordinate ring of V and let A* be the integral 
closure of A in K*. Let us denote by D(A*/A) the ideal in A generated 
by all the discriminants d(w,, Wn) over K of K-bases (wi, Wn) 
of K* which belong to A*. Let U C W be two irreducible proper subvarieties 
of V which are at finite distance for A. Then W is a branch variety of T if 
and only if D(A*/A)C M(W) MA; see [7]. Therefore, if W is a branch 
variety then D(A*/A) CM(W)NACM(U) NA and hence U is a branch 
variety. Again, if W is not a branch variety then D(A*/A) C M(W) NA 
and hence for some maximal ideal N in A containing M(W) MA we must 
have: D(A*/A) CN (Hilbert Nullstellensatz). This proves our first asser- 
tion. To prove the second assertion let A be any affine coordinate ring of V 
and let A* be the integral closure of A in K*. Since K*/K is separable, 
D(A*/A) 0 and hence the set of all points of B(T) which are at finite 
distance for A, is a proper subvariety of the affine part of V given by A; i.e. 
“B(T) is a set on a pure r-dimensional variety V/k such that if H is any 
hyperplane defined over k& in the embedding projective space S»,/k of V then 
B(T) A (Sm—H) is a proper subvariety in the affine space 8,,—H.” By 
induction on 7, one can easily show that the assumption between quotation 
marks implies that B(T) is a proper subvariety of V. Our second assertion 
can be proved also by using the concept and simple properties of symmetric 
products developed by Zariski in [14]. The proof, then, can be given as 
follows: Let P be a general point of V/k and let P;,P2,---,Pn be the n 
points which correspond to P for T-' (taking coordinates in some fixed 
universal domain. Let P—P,oP,0---oP, be the symmetric product of 
Fala’ a, ket T be the irreducible algebraic correspondence with a 
general point pair (P,P) over k. Since T is a finitely valued rational 


transformation without fundamental points, the same is true for 7; (T is in 
fact birational). Hence for any point Q (with coordinates in a universal 
domain) of V there corresponds a unique point 0 =(@,°Q.°0- + -o@, under 
T and hence the singular locus S of T is exactly the set of points on V to 
which correspond less than n points on V* for T-1. Therefore B(T') coincides 


with the set of rational points of S (since k is algebraically closed), and hence 
B(T) is a proper subvariety of V. 


DEFINITION 2. Let v be a real discrete valuation of a field K, K* a 
finite separable extension of K, and v* a K*-extension of v. Let M=M, 
and M*—M,.. We define: 
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*:v) =degree of v* over v=d(M*:M); 

= inseparability index of v* over v=1(M*:M) ; 

*:v) =separable residue degree of v* over v= g(M*:M); 
*:v) =ramification index of v* over v=r(M*:M); 

= reduced ramification index of v* over v=F(M*:M). 


Remark. In the notation of Lemma 3, let U be an irreducible r—1 
dimensional subvariety of V and U*,,U*.,- - -,U*, the corresponding sub- 
varieties of V*. Let wu and u*,,u*.,: - -,u*, be the real discrete valuations 
of K and K* respectively such that Q(U) =R, and Q(U*;) =Ry,. We 
observe that *(w*;:w) reduced ramification index (in our:sense) of u*; 
over w= the usual ramification index (in the sense of valuation theory) of 
u*; over u. One might be tempted to define: U is a branch variety if and 
only if F(u*;:w) > 1 for some j. One can easily see that with this definition 
the above Lemma is false, for then U may consist entirely of branch points 
without itself being a branch variety—this will certainly be so if 7(u*;:u) =1 
for all 7 and i(u*;:u) >1 for some 7; see Example 1 of Section 3 (the same 
example also shows that with this definition Theorem 1, which we shall prove 
in a moment, is false). Therefore our definition of a branch variety is the 
right one. Of course for characteristic zero the two definitions coincide. 


Lemma 4. Let K/k and V be as in Lemma 1. Let W a proper 
wreducible subvariety of V, and P a simple point of V which lies on W. 
Let (o,m) and (R,M) be the respective local rings of W and P on V. Let 
+, 2, be regular parameters in R. Then +, 2, are separating 
coordinates for W, t.¢., the set satisfies the following four 
conditions: (1) is a separating transcendence basis of K/k., 
(2) Co, (8) +,2,-] contains a basis of m, and 
(4) o/m is a finite separable algebraic extension of 


Proof. Let (R, 1) be the completion of (R,M). Since FR is regular 
and since it contains a representative k of its residue field, it follows as in 
Section 5 of [6] that we can indentify R with k[[2,,22,- --,2,]], and 
we can consider K as a subfield of k((a,22,- + -,2,)). By Prop. 1.5 of 
[4], and are linearly disjoint over 
and hence K/k(2,,22,- is separable.‘ 


*p denotes the characteristic of k. We are assuming p ~0; for in case p=0O, 
K/k(a,,@,° is trivially separable. 
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Let N= Ri=Sy, and 
M,=—WNR,. Let R, be the integral closure of R, in K. Then as in Proposi- 
tion 2 of [1], R=(Ri)pny. Let A=SOm, 0,=Sa4, and m= Ao, 
Let 6, be the integral closure of 0, in K. Then o= (6,);,,,. By part (III), 
Lemma 4 of [1], r(/:M,) =1 Therefore by part (a) of Lemma 6 of [1], 
r(m:m,)=1. This shows that - -,2,) is a separating coordinate 


system of W. 
The following theorem and its proof are due to Professor Zariski and 
were communicated to me personally. The proof is based on Zariski’s 


paper [12]. 


THEOREM 1. Let K/k, K*, V, V* and T be as in Lemma 1. Let 
B=B(T) be the branch locus of T on V and let B*=T"{B}. Let W be 
a simple irreducible subvariety of V and W* a corresponding subvariety of V*. 
(a) If W* is ramified over W, then there exists an r—1 dimensional 
irreducible subvariety U* of V* containing W* such that U* is ramified 
over its corresponding subvariety U of V. (b) The components of B* and B 
passing respectively through W* and W are pure r—1 dimensional. (c) If 
B(T) contains no singular points of V, in particular if V is nonsingular, 
then B(T) is pure r—1 dimensional. 


Proof. First observe that (b) and (c) follow at once from (a) and 
that in the proof of (a) we may, without loss of generality, let W* and W 
be points (rational over k) P* and P respectively. Now let (R*,M*) and 
M) be the respective local rings of P* and P, and let (2, be 
regular parameters in R. Let (y:,y2,- > -;Ym) be an affine general point 
of V*/k for which P* is at finite distance. Let A—k[y,, y2,° 
where Ymi==2 for 1=1,2,---,r. Then and hence after 
replacing V* by the variety having (41, ¥2,- * °;Ym+r) a8 a general point over 
k we may assume that (4;,42,° - *,Ymsr) is a general point of V*/k. Let 
L=k(2,,%2,- + +,2,) and let v* be the zero dimensional irreducible variety 
in the projective space S,,/Z with general point (41,42, *,Ym) over L. 
Then Q(v*, Sn/L) = Q(V*, Smir/k) Now v* is a simple point of S,,/L 
and hence M(V*,S,,/L) has a basis (f1,f2, **,fm) of m elements. By 
multiplying by a suitable unit in Q(V*,S»/L) we may assume that 
for 1=1,2,---,m. By Lemma 4, K*/L is separable and hence J 0 at 
Yi, =y where J =0(F,, ¥m)° 


5 See Section 2.2 of [12]. 
* See the Criterion for Uniformizing Parameters on page 27 of [12]. 
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Suppose, if possible, that J 40 at P*. Then 


at P*, where for t—=1,2,---,r. Therefore F1,F2,° Fur 
are uniformizing parameters in Q(P*,Smir/k).® Let 


Since R* = Q(P*,Smir/k)/E and since Fie for 
m+ +r, we conclude that M* = (2, 2%2,: -,2,)R* i.e. P* is unramified over 
P, which is a contradiction. Therefore J 0 at P*, i.e. J*e M* where J* 
is the value of J at Yo=Yo,° Ymer = Ymir- 

Let N* be an isolated prime ideal of J*R* in R*. Since J*R* is 
principal, V* is a minimal prime ideal in R* and hence defines an r—1 
dimensional subvariety U* of V* through P*.7 Let N=N*N RFR and UV the 
subvariety defined by N. Let S=k[a,,%2, --,2,], D=SON. Since k is 
algebraically closed, Sp/DSp is separably generated over k. By Maclane’s 
theorem (8.2 of [12]) we can choose a separating basis from the residues of 


say for instance 22, 2%3,- do this job. Fix 

such that D=—G(a,a,:--,a7,)8. As in the proof of Lemma 4, 
N=G(a,%,:--+,%,)R and Ry/(NRy) is a_ separable extension of 
k(x2,%3,° * *,@,). Suppose, if possible, that N* is unramified over JN. 
Then V* = +, 2,)R* and R*y+/(N*R*y+) is a separable extension 


in Q(U*, Snir/k) 5 (see corollary on page 9 of [12]). Therefore considering 
G as a point in (a1, we have: 

0G 
at Y, = Therefore J ~0 at Y;=—y,, Y2— yo, 


»Ynsr =Ymir, Which is a contradiction. Therefore U* is ramified 
over U8 


Proposition 1. Let (R,M) be a normal algebraic local ring with . 
quotient field K. Let K’ be a finite separable extension of K and let K* 
be a least Galois extension of K containing K’. Then M is a branch ideal 
(in R) for the extension K’/K if and only if it is so for the extension K*/K. 


* R* being normal all the primes of J*R* are minimal. 
*J =0 is a local equation of the component of B* through P*. 
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Proof. If M is not a branch ideal for K*/K then by Lemma 4 of [1] 
it follows that WM is not a branch ideal for K’/K. Now assume that M is 
not a branch ideal for K’/K. Let (R*;, M*;) for 1,2,- - -,¢ be the local 
rings in K* lying above R. Let G be the Galois group of K*/K, G’ the 
Galois group of K*/K’, and G; the inertia greup of M; over M. Then by 
Lemma 5.V.1 of [1], G’D G; for 7 =1,2,---,t. Let @ be the smallest 
subgroup of G which contains G,, G2,- - -,@,. Then @ is a normal subgroup 
of Gand GCG’. Therefore the fixed field K of G is a Galois extension of 
K containing K’, and hence K—K*, i.e, G—=(1). Therefore G,—G, 
—=-:-+:-=—(4;— (1), and hence by Lemma 5.1.3 of [1], r(M*;:M) =1 for 
j=1,2,---,¢; i.e, MZ is not a branch ideal for K*/K. 


2. Local Galois groups. 

Derinition 3. Let G be a finite group, p a prime number and s an 
integer. Let a be the smallest subgroup of G which contains all the Sylow 
p-groups of G. If G ts an abelian group which can be expressed as a direct 
product of t cyclic subgroups with ts, then we shall say that G ts a 
0,-group. If G/x is a 0s-group, we shall say that G is a ps-group. 


Derinition 4. Let (R, MM) be the local ring of a simple point P of 
an r dimensional algebraic variety V, and let W be a pure r—1 dimensional 
subvariety of V. Let W,,W.,---+,Ws be the irreducible components of W 
which pass through P. Let Nix=ROM(Wi,V). Let s be an integer with 
sr. Then we shall say that W has an s-fold normal crossing at P if 
there exists a minimal basis of M such that WN, for 


Lemma 5. Let H=k((X1, X2,° and 
E* the root field of f(Z) =Z"—X over E, where in case of nonzero charac- 
teristic we assume that n is not divisible by the characteristic. Let R* be 
the integral closure of R in E*, and let X* be a root of f(Z) in E*. 
n-1 

Then R* => X*iR, R* is a regular local ring of dimension r and 
i=0 

(X*, X2,X3,- + -,X,) are regular parameters in R*. 

Proof. Let v be the valuation of F such that R, = R,x,z), and let v* 
be a #*-extension of v. Since nv*(X*) —v(X;), v* is the only extension 
of v to E*. Since Y,"* is the discriminant of f(Z), we have that: 

n-1 
R*¥ => (X*1X,*")R, (see page of [9]). u*eR* implies that 

i=0 

n-1 


u* = with ue R, and v*(u*) =0. The v* values of any two 
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different nonzero terms in the above sum are distinct. Hence u;340 implies 


that v*(V*‘X,?-"u;) = 0 and hence that XY,?-"ujeR. Therefore R* = > X**R. 
4=0 
The remaining part of the lemma follows at once from this. 


In the rest of this section we shall deal with the following situation. 
Let K be an r dimensional function field over an algebraically closed ground 
field & of characteristic p and let K* be a finite separable extension of K. 
Let V be a projective normal model of K/k, V* a derived normal model 
of V in K*, and T the rational transformation from V* onto V. Let P 
and P* be corresponding points of V and V*, and let (R, M) and (R*, M*) 
be the local rings of P and P* respectively. Let Q be an algebraic closure 
of K*. 


LemMA 6. Assume K*/K is Galois and let be a finite 
number of elements in R such that N;—=y,R 1s a minimal prime ideal in R 
and hence that Ry, is the valuation ring a real discrete valuation w; of K. 
Let n; be the reduced ramification index over w; of a K*-extension of w;.° 
Assume that if then ny40(p) for j= 1,2,---,h. Let yp/™ be an 
n-th root of y;inQ. Let 


Let be a minimal prime of R different from N,, - and let 
be the valuation of K with valuation ring Ry,. Let w*; be an extension of 
w, to K*, w*; the K*-restriction of 0*;, and @; the K-restriction of w*;. 
Then: (a) K*/K, K/K, K*/K* and K*/K are Galois extensions. (b) 
w*;) =F (O*;: w*;) =1 and r( 0; = w;) =F w;) for 7—1, 2, 

‘she Wo) =1, 1(W*o: Do) Wo), and 


5: = F(w*o: Wo). 


Proof. We shall assume that h =1, since the general case follows from 
this by a straightforward induction. (a) follows from the fact that the 
compositum of two Galois extensions of K is again a Galois extension of K. 
Now let [K*:K*] =n. Then n,=0(n). Let r=y,1/™ and z=a". Then 
f(¥) =X"—z is the minimal monic polynomial of x over K*. Now 2"? 
is the discriminant of f(X). Since z" is a unit in Rys,, r(*):w*)) =1. 
Fix we Ry», such that wRy+,—=My+,. Then 2/u is a primitive element of 
K*/K*, and g(X) =X"—zu™ is the minimal monic polynomial of 2/u 
over A*, Since w*,(zu-") = 0, the w*,-residue e of zu is a nonzero element. 


Hence the discriminant of ¥"—e is also a nonzero element of Rys,/My»,. 


* Since K*/K is Galois, n, depends only on w,. 
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Therefore the discriminant of g(X) is a unit in Rys,, and hence r(*,: w*;) = 1. 
Since n,,(x) we have that r(@,:w1) =7(@,:w,) =m. The rest 
of the assertions follow in a similar fashion. 


Lemma 7%. Assume that P is simple; P* is the only point which corre- 
sponds to P; Z*—2, is the minimal equation of a primitive element x* of 
K*/K; [K*:K]40 if p¥0, and that x, is part of a minimal basis 
(x1, of M. Then P* is simple and M* = (2*, +, 


Proof. Follows from Proposition 1 of [1] and Lemma 5. 


TueoreM 2. Let B be the branch locus on V. Assume that P 1s simple 
and that the component B of B, which is ramified for the extension P— P*, 
has a t-fold normal crossing at P.1° Then the local Galois group G(P*/P) 
is @ pr-group. 


Proof. Let K’ be the least Galois extension of K such that K* C K’ CQ, 
P’ a point corresponding to P* on a derived normal model of V* in K’, and 
(R’, M’) the local ring of P’. Let K, and K*, be the splitting fields of P’ 
over P and P* respectively, and let K’ be the least Galois extension of K, 
such that K*,C K’ CK’. Let P,, P*,, P’ be the points corresponding to P’ on 
derived normal models V,, V*,, V’ of V in K,, K*,, K’ respectively. The 
results of Section 1 and of Section 2 of [1] imply the following: P, is simple; 
if B, is the image of B for the transformation V > V,, then B, is the branch 
variety for the extension P— P’, B, has a t-fold normal crossing at P,; and 
G(P’/P,;) =G(P*/P). Therefore we may replace V, P, V*, P* by Vs, Ps 
V’, P’ respectively. 

Hence to start with we may assume that P* is the only point corre- 
sponding to P and that K*/K is Galois. Let G be the Galois group of K*/K 
and let « be the smallest subgroup of G which contains all the p-Sylow groups 
of G if p40 and let x —(1) if p—0. Let K be the fixed field of +, Va 
derived normal model of V in K and P the point in V corresponding to P*. 
Then the branch locus B for the extension P—>P is a subvariety of B and 
hence B has a m-fold normal crossing at P with mt. We have to show 
that G/z is a 0;-group, and hence enough to show that G/z is a Om-group. 

Therefore to start with we may assume that if p40 then [K*: K] 0(p); 


and then we have to show that @ is a 0;-group. Let (21,22, - -,2,) be regular 
parameters in such that —=—Q(B;,V) for 7=1,2,---,¢ where 


10 By Theorem 1, B is pure r— 1 dimensional. B consists of those irreducible com- 
ponents B, of B passing through P for which there exists a corresponding variety B”*, 
through P* with r(B*,: B,) >1. 
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B,, By are the irreducible components of B. Let B*; be an irreducible 
subvariety of V* corresponding to B;, and let nj==r(B*;:B;).% Let aj/™ 
be an root of a; in QO. Let and 
K,=K Let be a point corresponding to P* on 
a derived normal model V, of V in K, and let P, be the point corresponding 
to P. on a derived normal model V, of V in K,. Let (R,,M,) be the 
quotient ring of P,. Theorem 1 and Lemmas 6 and 7 imply that P, is a 
=(7(P,:P) =the direct product of ¢ cyclic groups of orders m,M2,- - 
Since G(P*:P) is a subgroup of G(P.2:P), we conclude that G(P*:P) 
is 0,-group. 


THEOREM 3.'* Let the assumptions be as in Theorem 2 and let p=0. 
Let (#:,%2,° *,%,) be regular parameters in R such that = Q (Bj, V) 
for 7=1,2,---,¢; where B,,Bo,- are the irreducible components of 
B. Let R=k[[a,,22,- + -,2,]] and R* be the completions of R and R* 
respectively; and embed R in R* canonically. Let E and E* be the respective 
quotient fields of Rand R*. Then P* is Galois over P, G(P*:P) is a 0;-group 
are suitable integers. 


Proof. Follows from the considerations made in the proof of Theorem 2. 


THEOREM 4. Let the assumptions be as in Theorem 3 and let t=—1, 
1.e., let B have a simple point at P. Then P* ts simple, 


R* k[ [a2/™, Lo, tl], 
and x,h* is primary, 1.e., B does not split locally for the extension P—> P*. 


Proof. That P* is simple and R* =k[[2,”™, x2, -,2,-]] follows 
from Theorem 3. Since z, is the power of an irreducible element in &*, 
it must already be so in R*. 


vemark. Let G(P*:P) be the local Galois group over a ¢-fold normal 
crossing as referred to in Theorem 2. If p40, G(P*:P) can be very messy 
and insolvable even for t==2 and ¢=1 as will be shown by Examples 3 
and 4 in Section 3. Hence in a certain sense, Theorem 2 gives a best 
possible description of local Galois groups over normal crossings. Since 


since [K*: K] =0(p) if p ~ 0. 
*? Known in the classical case when k is the field of complex numbers; the known 
proofs being topological. 
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Theorem 4 essentially depends on the fact that for p—0 the local Galois 
group above a simple point of the branch locus is cyclic and since for p40 
such a local Galois group need not be cyclic, we cannot expect this theorem 
to hold for p40. Examples 1, 2 and 5 of Section 3 will show that for p0, 
a point above a simple point of the branch locus need not be simple. Example 
5 will also show that even if the branch locus B for P—P* has a simple 
point at P, still the image B* of B may split at P*. If we have a sequence 
P— P*,—> P*,—: -— P*, such that the branch locus B for P—P*, has 
a simple point at P, B may acquire a singularity at P*,, and hence a bad 
singularity at P*,, and a worse one at P*,; etc.; thus making the groups 
G(P*,;:P*;,) more and more complex. This “local splitting of a simple 
branch variety by itself” is the real reason behind the failure of Theorem 3 
and 4 for p0, and which in turn explains the failure for nonzero charac- 
teristic fields of Jung’s classical method of uniformization (see Section 4). 


3. Some examples. In the following examples we shall be dealing with 
a normal surface F in a projective space S; over algebraically closed ground 
field k of characteristic p340. F will be given by its defining homogeneous 
polynomial F in the homogeneous coordinates XY, Y, Z, T in 83. We shall 
project F from the point Y = Y —T—0 onto the (X,Y,T) plane. By K 
and K we shall denote the function fields of the (Y,Y,7) plane and of F 
respectively. By D we shall denote the branch curve of this projection on 
the (X,Y,7) plane. We shall omit some details in the proofs which consist 
of straightforward computations. For instance: since k is algebraically closed, 
F =0F/0Z =0F /0X =0F/0Y =0F/0T =0 will give the singular manifold 
of F (Theorem 7 of [12]). In Examples 1 to 5, F will have only a finite 
number of singular points, from which will follow the irreducibility of F 
and hence of #. Consequently in Examples 1 to 5, we shall have that the 
singularities of F are exactly the singularities of F and the finitenes of their 
number will in turn imply the (arithmetic) normality of # (Theorem 3 of 
[8]). We shall find D by computing the Z-discriminant of F. Also we 
shall use well-known facts from ramification theory of real discrete valuations, 
for which we refer to [3] and [5]. 


Example 1. 
Ze — — YP T if 
if p=2. 
I’: F =0 is a normal surface with XY = Y = Z =0 as the only singular point. 
AX = 0 is the equation of the branch curve D. Thus = Y —0 is a simple 
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point of D and above tt lies the singular point X =Y —Z=0 of FP. K/K 
is a cyclic extension of degree p. Consider the real discrete valuation vx of 
K/k given by the Y-axis: X =0, in the (X,Y)-plane. For p2, reducing 
F modulo vx we get: Z?—Y*+=0. This is an irreducible pure inseparable 
equation over the residue field k(Y) of vx. Hence vx has a unique extension 
dy to K. We have: i(@x: vx) =p and #(Gx:vx) =1. For p=2, writing 
Z* for Z+Y we get: F=Z**°4+XZ*+ XY. Again vx has a unique dx 
to K, but now #(ix: vx) =2 and vx) =1 since = $ux(X). 


Example 2. 


F = Z?— —XY?", 


F:F =O is a normal surface with X = Y —Z=—O0 as the only singuarity. 
X =0 is the equation of the branch curve D. Again X = Y —0 ts a simple 
point of D and above it lies the singular point X =Y =Z=0 of F. Again 
K/K is cyclic of degree p. Let vx be the valuation of K/k given by X —0 
and let 0x be an extension of vy to K. Then @x(Z) = (1/p)vx(X). Hence 
Dy is unique, 7(7x:vx) =p, and (Note that Examples 1 
and 2 coincide for p=2). 


Example 3. p=2. 
+ X7?°T? + YZT? 4+ XT. 


F:F =0 is a normal surface with X =~ Y—Z=—0 and as 
the only singular points. Components of the branch curve D through the 
point 9: Y are =0 and Y=—0. So Qs an ordinary double 
point of D. 

The point P:X = Y—Z=0 is the only point of F above Q. Let R 
and & be the completions of the local rings of P and Q respectively, and let 
E and E be the respective quotient fields of R and FZ. It is clear that 
and that 


F(Z) 


is the minimal polynomials of a primitive element of # over H. Let E* 
be the least normal extension of £ containing #, and let G be the Galois 
group of E* over £. 


We shall show that G is insolvable. 


Since */F is the root field of f(Z) = 0 we can consider G as a subgroup 
of the permutation group P; on five symbols. Let n be the order of G. 


= 
4 
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Consider the discrete valuation vy of & given by the prime element X of R. 
Reducing the equation f(Z) —0 modulo vy we get: 


Therefore there exists a valuation of H* above vy with inseparability index 
divisible by 4. 

Similarly reducing f(Z) 0 module the valuation vy_xe of EF given by 
Y —X° we get: 


4 — (7274 (284 X) =0. 


Therefore there exists a valuation of H* above vy_ys with residue degree 
divisible by 3. 

Hence n is divisible by 3, 4 and 5; i.e., m is divisible by 60. Since P, 
is of order 120 and since the alternating group A, is the only subgroup of P; 
of order 60; it follows that G is either A; or P;. In either case G is insolvable. 


Example 4. p=2. 
XZ*+ + XY’°T. 


Again F': F =0 is a normal surface with X = Y =Z=—0 and X = T —=Z=0 
as the only singular points. But now X —0 is the only component of D 
through the point @:XY¥ =Y=—Z=0. So Qs a simple point of D. Also 
P:X =Y =Z=0 is the only point of F above Q. Let R, R, FH, F, E* and 
G be as Example 3. Now YZ°4+ is the 
minimal polynomial of a primitive element of #/H. Again consider G as a 
subgroup of the permutation group P; on five symbols. 

Reducing f(Z) 0 module the valuation vy of H given by the prime 
element XY of R we get: 

=0. 


Therefore there exists a valuation v, of # above vy with inseparability index 
divisible by 2. Also we have that there is an extension v, of vx to F such 
that v.(z) > 0 where z is a root of f(Z) 0. It is clear that Yz* and XY? must 
be the minimum value terms in f(z) for v2, i.e., we have v.(Y2*) =v.(XY’). 
Since v.(Y) =vx(Y) and v,(X) vx(X) —1; it follows that v.(z) = 4, 
and hence we have that 7(v.:vx) 23. Now v;—v, implies that r(v,: vx) 
=>6>[#:E] which is a contradiction. Hence v,v,. Therefore 
+ F(ve:vxy) S5—[#:E] i.e, F(ve:vx) Therefore every 
extension v* of vx to E* is such that 1(v*: vx) is divisible by 2 and #(v*: vx) 
is divisible by 3. 


> 
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Therefore the order n of G is divisible by 2, 3, 5 and hence by 30. 
Hence G@ is either (i) Ps, or (ii) As, or (iii) a subgroup of P, of order 30. 
In cases (i) and (ii) @ is insolvable. We shall show that case (ili) cannot 
arise by proving that P; has no subgroup of order 30. Suppose then, if 
possible, that H is a subgroup of Ps of order 30. Let A =H,, Az, Hs, Hy, 
be the right cosets of H, and for we P, let 


(tu)(H,, He, Hs, Hs) = Hau, Hau, Huu). 


Then ¢ is a homomorphism of P; into the group of permutations of H,, Ha, 
H,, Hy. Since we H; implies Hw—H;, 4SSorder of ¢(P;) S24. Hence 
is a normal subgroup of P, of order m, where (120)/(24) =5 30 
=120/4. A contradiction since A, is the only normal subgroup of Ps. 
Therefore G is insolvable. 


Example 5. 
Zh 4 Yr + XP, 


The surface F: =0 is normal and has = Y=Z=—0 and X¥ =T=Z—=0 
as the only singular point. The point Q@:X —=Y —0 1s a simple point of the 
branch curve D:X =0. Again in the terminology of Example 3, 


is the minimal polynomial of a primitive element of H/H. Consider the 
Galois group G of H*/E, as a subgroup of the permutation group Pp), on 
the roots of f(Z) 0. Modulo the valuation vx of H given by the prime 
element in the equation f(Z) —0 becomes Y?") =0. Hence 
there exists a valuation of #* above vx with inseparability index divisible by p. 
Since the order n of G must then be divisible by p and since p is prime, 
G must contain a p-cycle. Thus G is a subgroup of P5,,, G contains a p-cycle, 
and the order of G@ is divisible by p(p-+1). For p=5b such a group G is 
very large and quite complicated. 

Observe that the point P=X—=—Y —Z=0 is above the simple point 
Q:X = Y =0 of the branch curve D: X =0; and still P is singular. D splits 
in passing from Q to P as is shown by the equation: Z(Z? + Y?") —0. 


Example 6. p=5. 
X. 


F: F =0 is nonsingular at finite distance. Q:X:Y —0 is an ordinary double 
point of the local branch curve XY =0. The only point of F above Q is 
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P:X=Y=Z=0. P isa simple point of F but still the local Galois group 
G of P over Q (notation of Example 3) ts insolvable. 


Modulo the valuation vy of H given by X, we have 
=0. 


Hence there exists a valuation v, of # over vx with separable residue degree 
over Vx divisible by 3. Also there is a valuation vz of # above vx for which 
v,(z) >0 where z is a root of F(Z) =0. Since for v2, Yz* and X are the 
minimum value terms of F(z); it follows that v.(z) = (4)ux(X) As 
in Example 4 we conclude that: v; Av2, vx) = 2, and g(vi: vx) =3. 
(One extension with g=—3 and one extension with 7—2 makes up 3-4-2 
= 5=—[2:F] and accounts for all the #-extensions of vx). Hence the order 
of G@ is divisible by 2K 3X 5=30. The insolvability of G now follows as 
in Example 4. 


Remark. In the above examples, let @ and P be the points X = Y —0 
and X =~ Y of the (X,Y,7) plane and of respectively. Observe 
that in all the examples: r(P:Q) =7(P:Q) =[K:K]=[L£:E£]. Now let 
K* be the least Galois extension of K containing K, and let P* be a point 
corresponding to P on a derived normal model F* of the (Y,Y,7) plane 
in K*. By Proposition 1, D is the branch curve also for the rational trans- 
formation from F* onto the (XY, Y,7) plane. In Example 3, Q is an ordinary 
double point of D and in Examples 4 and 5, Q is a simple point of D. In 
Examples 3 and 4, G(P*:Q) is insolvable. In Example 5, since D splits 
in passing from Q to P, D splits a fortiori in passing from Q to P*. In 
Examples 1 and 2, K = K* and hence P = P*. 


4, Jung’s method of uniformization. Let S be an algebraic surface 
in an n-dimensional projective space over an algebraically closed ground 
field k and let K be the function field of S. The classical problem of local 
uniformization on algebraic surfaces deals with the case when & is the field 
of complex numbers. It roughly asserts the following: Let P be a point on S. 
Then some complete neighborhood of P on S can be covered by a finite 
number of regions R,,R.,---,R; such that in each R; the coordinates 
1,X2,° * *, 2%» Of the points in R; can be expressed as convergent power series 
in two variables u, v; this representation being unramified in a certain sense. 
This problem was solved by Jung? by using a very fruitful method. Zariski 
generalized this problem when & is an abstract field, stating it in terms of 


valuation theory. If we use the concept of normal varieties due to Zariski, 
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then the basic ideas behind Jung’s proof become strikingly simple and are 
as described below. 

Let V be a normal model of the function field K/k in a projective space 
§, or dimension r. Project V onto a plane V’, from an 8,3 which does not 
meet V. Let K’/k be the function field of V’. Let 7’ be the rational trans- 
formation from V onto V’, thus defined. Since T and T-* are free from 
fundamental points, V is a derived normal model of V’ in K. By Theorem 2, 
the branch locus D on V is a curve. Let Q be a singular point of D. Apply 
to V’ a locally quadratic transformation ¢, with center Q, getting a new 
surface V’;. To Q there will correspond a fundamental line LZ, on V’;. 
We have that there is a rational transformation t7 of V onto V’, whose 
inverse is finitely valued at each point of V’;. Let V, denote a derived 
normal model of V’, in the function field K/k. Let JT, be the rational 
transformation of V, onto V’, (defined in a natural fashion). Then V;, V’; 
and 7’, are in the same relationship to each other as V, V’ and T. Also 
V, has no singular points. The branch curve D, of 7," is either the proper 
transform of D, or it is this transform together with the line Z,. After a 
finite number of steps it is possible to get V,, V’; and 7, such that the 
branch curve D, on V’, has only ordinary double points. We may assume 
that already V, V’, and T have this property, i.e., that D has only ordinary 
double points. 

Let Q be a point of V’ and let P be a point of V corresponding to Q. 
We have proved in Section 2 that: (a) If Q¢ D, then P is a simple point of V. 
(b) If Q is a simple point of D, then G(P/Q) is cyclic and again P is a 
simple point. (c) If Q is an ordinary double point of D, then G(P/Q) is 
the direct product of two cyclic groups.’? So it is only necessary to consider 
what happens “above” an ordinary double point Q of the branch curve D. 
Let P be a point of V above Q. The simple structure of G(P/Q) in this 
case enables Jung to resolve the singularity of V at @ in the classical case 
when k is the field of complex numbers, by topological considerations. 

Now supposing that k is of characteristic p40, we tried to generalize 
Jung’s method. But unfortunately in this case of modular ground fields as 
was shown in Section 3, the statements (b) and (c) are not valid and the 
groups mentioned there may even be insolvable. If these local Galois groups 
were at least solvable, that would have been of some use for the purposes of 
uniformization. 
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A NOTE ON TWO DIMENSIONAL DIVISION 
RING EXTENSIONS.* + 


By N. Jacosson. 


A division subring T of a division ring A is said to be Galois in A (and 
A is Galois over T) if T is the set of invariants (or fixed points) of a group 
of automorphisms acting in A. The two dimensional Galois extensions of a 
division ring have been determined by Dieudonné.? In this note we shall 
show that if T is a division ring of characteristic #2 which is finite 
dimensional over its center ¥ and A contains T and has left dimensionality 
[A:T ],—2 then A is Galois over fT. On the other hand, we shall construct 
a class of examples where [A:T], —2—[A:I]p, of characteristic 
and A is not Galois over IL. 


We prove first a general theorem on dimensionalities as follows. 


THEOREM 1. Let A be a division ring, T a division subring, ® and ¥ 
the centers of A and T respectively. (1) If [A:T],< and [T:¥] <o 
then [A:®] <oo. (2) If [A:®] <oo then [T:¥] S[A:@], the equality 
holding if and only if the division subring @(T) generated by ® and T 
coincides with A. 


Proof. (1) Our assumptions here imply that [A:¥]z;,<0oo. The result 
will therefore follow from the fact, which we proceed to prove, that if a 
division ring A is of finite left dimensionality over a subfield ¥ then A is of 


finite dimensionality over its center. Let 8,,8,- - -,8, be elements of A 
which are linearly independent over ©. Then it is well known that the right 
multiplications are linearly independent over the division 


ring Ay, of left multiplications. Hence 8;z are linearly independent over Wz 
the subfield of A; determined by ¥. On the other hand, the ring & of linear 
transformations of the left vector space A over © contains %, since W is 
commutative. Moreover, [@:¥,] = [A:W]z?. Since the this implies 
that n= Hence (2) If °°, 2, are 


* Received January 4, 1955. 
*This research was supported in part by a grant from the National Science 
Foundation. 
CE, also [1]. 
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elements of a ring then we define - => + where the 
P 


summation is over all the permutations (%,,%2,- - -,%,) of and 
the sign is + or — according as the permutation is even or odd. It has 
been shown by Amitsur and Levitzki ([2]) that if ®, is the ring of n Xn 
matrices over a field then [272° for all aje®, but there exist 
in such that [222° 40. This implies by the 
usual base field extension argument that if A is a division ring which is n? di- 
mensional * over its center then ==0 in A but Tons] 
in A. Also it is a known result of Kaplansky’s ([7]) that if [A:®] =o 
then A satisfies no polynomial identity with coefficients in ®. It is clear from 
these results that [A:&] <oo implies [[:¥] =[A:6]. Now consider 
This is a subspace of A over ®; hence it is finite dimensional. Let y1, y2,°- +, yr 
be a maximal set of elements of T which are linearly independent over ® and 
let V be the &-space spanned by these. Evidently TCV. Hence V is a 
subring of A. Since V is finite dimensional over @ it is a division subring. 
Thus V—®(T). Suppose then | for all 
choices of i;=1,2,---,r. Hence [2,22- *aom|==0 in =V so that 
if P is the center of ®(T), then [@(T):P]=m?. On the other hand, if 
==0 in then the same holds in T. It follows that 
Hence [T:¥] Now if [T:¥] 
= [A:6] then [@(T) : P] =n? and, since PD [6(T): 6] Thus 
®(T) —A. The converse is clear by the same argument. 


THEOREM 2. Let T be finite dimensional over its center WV and let A 
be a division ring containing T and such that [A:T],—2. Then if the 
characteristic is ~2, A is Galois over T. 


Proof. Theorem 1 shows that [A:®]<oo for ® the center of A. We 
distinguish two cases; J. TD. Let €(T) denote the centralizer of T in A. 
Then it is well known that €(€(T)) —Tf so that T is the set of invariants 
of the inner automorphisms determined by the non-zero elements of €(T). 
Il. Let Then and #?—y,t+y2, Since 
t and commute with every this gives (yyi)t + yyo= (yry)t + 


Hence yy; =vyiy and so that y; are in the center of Since A—T(t) 
where te it is clear that YIM. Hence the y,e. We may replace ¢ 
by w=t—Hy, and obtain For a, Bel, the mapping a+ fu 
— %— Bu is an automorphism in A whose set of invariants is T.* 

*It is well known that if A is finite over its center then its dimensionality over the 


center is a square. 
*I am indebted to the referee for the following remark: In part II of the above 
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In the remainder of this note we shall construct some examples of two 
dimensional non-Galois extensions. These will be given as certain Clifford 
algebras of infinite dimensionality. We shall therefore begin by indicating 
that the well-known results on finite dimensional Clifford algebras can be 
carried over to the infinite case. 

Let M be a vector space over a field ® and let Q(x) be a quadratic form 
on M. We denote the associated scalar product Q(x+y) —Q(x) —Q(y) 
by (z,y). This is symmetric and if the characteristic is two then (z,y) is 
alternate: (x,2) If N is a subspace we define = {y| (x, y) =0,reN}. 
N is isotropic if NN NLS {0}, totally isotropic if MCG N+. The scalar 
product (and the quadratic form) is non-degenerate if M1+ {0}, totally 
regular if there exists no isotropic ((#,7) =0) «0. 

Let % be the free algebra 1 M 
based on the space M and let 8 be the ideal in F generated by r®@x— Q(z)1. 
Then A= %/% is called the Clifford algebra of M relative to Q. 

It is easy to see that the natural homomorphism of UM into A is an 
isomorphism ([3], p. 39). One may therefore consider M as a subspace of A. 
If NV is a subspace of M, then the subalgebra of A generated by 1 and N is 
isomorphic, and so may be identified with the Clifford algebra A(N,Q) of NV 
relative to the contraction of Q.5 

From now on we assume that (z,y) is non-degenerate. Then it is well 
known that if M/ is even (finite) dimensional then A is central simple and if 
M is odd dimensional then A is a Kronecker product of a central simple 
algebra and a two dimensional semi-simple algebra. The structure of A can 
be described more precisely by using the following 


Lemma. Let F bea non-tsotropic subspace of 2n <0 dimensions. Then 
the Clifford algebra A(M,Q) of M relative to Q is a Kronecker product 
A(F,Q) @A(F1, (—1)4C#D8Q) of the Clifford algebra of F and Q and 
that of FL and (—1)#@#8Q where & is the discriminant of (x,y) in F. 


Proof. Char. #2. We choose an orthogonal basis (w1, 
for F and set (u;,u;) =8;1. Set c= - “Usy. Since ujuj = — 
for i Aj, ui? = 48,1, c? = (—1)#@“D8 where §=II8; is the discriminant of 
(x,y). It is known that for finite dimensional non-isotropic subspaces 


proof, no use is made of dimensionality relations. Hence the argument shows that if 
[A:T], = 2 for arbitrary T, and Ip 4, then A is the Kronecker product over ¥ of 
lr and a quadratic extension of ¥. | 

°It suffices to prove this for finite dimensional VN. The arguments given in [3] 
suffice in this case. If N is even dimensional and non-isotropic then the lemma given 
below can also be used. An easy reduction can be made to this special case. 


Pc] 


596 N. JACOBSON. 


F,M=FQ@FL. If and (cr)? It 
follows easily that the subalgebra generated by the elements cz is the Clifford 
algebra A(F4, (—1)#@#8Q) and 


A(M,Q) =A(F,Q) @A(FA, (—1)#C8Q). 
Char. 2. Here we can choose a symplectic basis for F’: 
(1, V1, U2, * Ups Vp) 


with (uj;,v;) —1—(v,u;) and all other products 0. The discriminant of 
(x,y) relative to this basis is 1. If xe F, ye F1 then zy=yz. It follows 
that A(M,Q) =A(F,Q) @A(FL,Q). 

This lemma shows that if M/ is of even dimensionality then A is a 
Kronecker product of generalized quaternion algebras and if M is odd 
dimensional then A is a Kronecker product of generalized quaternion algebras 
and a two dimensional semi-simple algebra. We now prove 


THEOREM 3. Let M be an infinite dimensional vector space with a non- 
degenerate quadratic form Q. Then the Clifford algebra A(M,Q) ts central 
simple. Moreover, if dim M =, then A(M,Q) ts a Kronecker product of a 
countable number of generalized quaternion algebras and any such product 
can be obtained from a suitable Clifford algebra. 


Proof. The first statement will follow by showing that any finite subset i. 


S of A=A(M,Q) is contained in a finite dimensional central simple sub- 
algebra containing 1. By using a basis one sees that S is contained in the 
subalgebra generated by 1 and a finite dimensional subspace F. Since any 
finite dimensional subspace can be imbedded in a non-isotropic one, we may 
assume F’ non-isotropic. Also it is easy to see (using M—F @ F1) that we 
may assume dim#’'=2y. Then the subalgebra A(F,Q) is central simple. 
Now suppose dim M—8&,. If char2 we denote the generalized quaternion 
algebra with basis (1,1,7,77) such that 7? =a1,7?=— 1, ij —=—ji by (4,8) 
and if char.—2 we denote the generalized quaternion algebra with basis 
(1,1,7,17) such that #?—al, j?— £1, y+ ji—1 by [@,B]. If char. 
we choose an orthogonal basis (w;, Us, +) with Q(u;) = 8; 
({8], p. 3). Then using (ap?, Bp?) = (a,8) for all p40 and the lemma, 
we obtain 


A= B:) (-— %1 — %18,B2) ® (—— %232%3, — BoBs) 
®-- -@ (— — 
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If char. = 2 we choose a symplectic basis (1, V1, U2, *) with (u,%) =1 
—(v;,ui) ({8], p. 4). If Q(ui) =a, Q(vi1) =f then the above lemma 


gives the formula 
A = [21,81] [ a>, Bo] [ a3, Bs] 
The last statement of the theorem is clear from the above formulas. 


It has been shown by Koethe ([9]) that there exist infinite Kronecker 
products of quaternion algebras which are division algebras. We shall sketch 
an alternative proof. Let T be a given division ring with center ¥. Then 
the polynomial domain ['[A] is a principal ideal domain and so it can be 
imbedded in a quotient field T(A). The center of is (A). If char. 2, 
we let S be the automorphism in T'(A) which is the identitly on I and sends 
\ into —A and we form the semi-linear polynomial domain T(A)[#, 8] of 
polynomials in ¢ with coefficients in T'(A) such that ta= (aS)t, ae P(A) 
({5], p. 29). This is a principal ideal domain and can be imbedded in a 
quotient field A. The center of A is ¥(A?,¢?) and A=T(A?, t?) (A?, 
where T'(A*, ¢?) is the division subring generated by IT, A?, ¢?. We can begin 
our construction with a field T=, and repeat this an infinite number of 
times to obtain a division algebra with center ®(A,?,t,?, A»”, t2?,- - -) which has 
the form (A,7, @ (A2”, - - over this center. If char. 2 we modify 
the construction by replacing the automorphism S by the derivation D in 
r(A) such that TD = {0} and AD=—1. Then we consider the differential 
polynomial domain T'(A)[¢,D] with ta=at+aD. In the end we obtain 
the center ®)(A,°, t,7, - -) and the division algebra 


[A1?, #12] ® @- - - 


over this center. 


We now suppose that A=A(M,Q) is a Clifford algebra of a space M 
relative to a quadratic form Q and that A is a division ring. Let H be a 
hyperplane in M, so that M =H @ 6u where we M and © is the base field. 
Let T= A(H,@Q) be the subalgebra generated by 1 and H. Since the elements 
of A are algebraic over ®, [I is a division ring. We have the relation 
ut = (x,u)1 for re H. Hence if acl then we have a relation of 
the form wa = (al)u-+-aD where al and aD are elements in I. Since u¢T 
the elements aI and aD are uniquely determined by a. One verifies that I is 
an isomorphism of I into itself and D is an J-derivation: (ab) D = (aI) (bD) 
+(aD)b. Since the image under J contains H it is clear that I is an 
automorphism. The relations wa—=(al)u+ (aD), u2=Q(u)1 show that 
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(1,u) is both a left basis and a right basis for A over I. Hence [A:T],=—2 
=[A:T]r. We now prove 


TuroreM 4. Let A=A(M,Q) be a Clifford division algebra determined 
by a vector space M relative to a quadratic form Q and let T=A(H,Q) be 
the subalgebra generated by 1 and a hyperplane H. Assume char. A2. Then 
A is Galois over T if and ony if Ht S+ {0}. 


Proof. Since A is a division ring, M contains no isotropic vectors. 
Hence HN HL {0}. If H+ {0} we choose wu in this space. Then we have 
ua = (al)u and we have an automorphism mapping every aeT into itself and 
mapping u into —u. The set of invariants relatively to this automorphism is 
Tr. Hence T is Galois in A. Conversely, assume A is Galois over T. Then it is 
known that A—TI(t) where ¢ is an element such that the inner automorphism 
determined by ¢ maps T into itself. Write t=cu+d,c,deT, c~0. We 
may replace t by If aeQT the condition va—a,1, 
a,eT gives a,—al and aD=(al)e—ea. In particular, if reH, then 
(z,u)1=—(ae+er). If M is finite dimensional then it is clear that 
H14S~ {0}. Hence we may assume MV infinite dimensional. Then we can 
find an even dimensional subspace F' in H such that ee A(F, Q) the subalgebra 
generated by 1 and F. Since Q is non-degenerate in F we can find a wu’ e F such 
that (x, u’) = (2, u) for all re F. Hence if v’ =u’ + e then ev’ + =0, re F. 


If is an orthogonal basis for F, the element 
has an inverse and anti-commutes with every ze F. Since A(F,Q) is central 
this implies that v’ =pu,v,- - -u,v,. We now imbed F in a space G in H 
of (24-+ 2)-dimensions with orthogonal basis (;,01,° *, Since 
(Uysis U) 1 — Hence p=0 and e=—uw’eH. Then 


(x,u) = (2,u’) for all ce H and u—w’ +0 is in H1, Thus Hl {0}. 

If VW is an &, dimensional space over a field of characteristic #2 and 
Q is a quadratic form whose associated scalar product is non-degenerate, then 
we can always find hyperplanes H in M such that Hi = {0}. This is easily 
seen by using an orthogonal basis. This remark and Theorem 4 show that 
there exist two dimensional non-Galois extensions of division rings of 
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ON THE ORTHOGONALIZATION OF OPERATOR 
REPRESENTATIONS.* 


By RicHarp V. KapIson. 


1. Introduction. The main problem with which we shall be concerned 
is that of finding conditions under which a group representation is similar 
to a unitary representation and conditions for a representation of a self- 
adjoint algebra of operators on a Hilbert space to be similar to an adjoint 
preserving representation (* representation). These situations are slightly 
different aspects of the representation orthogonalization process with very close 
interconnections. With regard to the algebra situation one can phrase the 
main question in the following way. Is an algebra of operators on a Hilbert 
space which ts the isomorphic image of a C*-algebra (uniformly closed self- 
adjoint algebra of operators) similar to a C*-algebra in such a way that the 
composition of the isomorphism and the similarity 1s an adjoint preserving 
representation of the C*-algebra? The question for group representations 
takes the following form. Js every bounded representation of a group by 
operators on a Hilbert space similar to a unitary representation, where, by 
“bounded representation” we mean that there exists a constant such that 
each representing operator is, in norm, less than this constant? In this form, 
the group question has been raised before, notably in [1], [2], [5]. The 
question of when a group admits a mean (or Banach Limit) is the primary 
consideration of these papers and then, employing the technique of [4], it is 
shown that for such groups all bounded representations (continuous in the 
strong topology) are similar to unitary representations (this is done in [4] 
for the infinite cyclic group and the reals). We outline this technique. A 
(left, right) mean on a group G is a positive linear functional on the linear 
space B(G) of bounded continuous functions on G which takes the value 1 
at the constant function 1 and which is invariant under (left, right) transla- 
tions on the group. If G admits a right mean m and g—A, is a strongly 
continuous representation of the group by operators on a Hilbert space, then, 
for each z, y in the Hilbert space g > (A,z, A,y) is a bounded continuous 
function on G and <2, y) = m((A,z, A,y)) is an inner product on the Hilbert 


* Received February 17, 1954: revised February 1, 1955. 
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space giving rise to a norm equivalent to the original norm and under which 
the operators A, are unitary. It is then standard to find the similarity of 
the original Hilbert space which takes each A, into a unitary operator. 
Continuity considerations do not enter into the question of whether all 
bounded representations of all groups are similar to unitary representations 
since it is obviously sufficient to settle this for discrete groups. With regard 
to the technique of means, it is well-known that many groups admit neither 
a left nor right mean (in the sense noted above, e.g., the free group on two 
generators; see [1], [2]) so that this method cannot, in itself, give the full 
answer. 

Our concern is not with conditions on the group which imply that 
bounded representations are similar to unitary representations but rather 
with restrictions upon the representations which insure that they are similar 
to unitary representations. We feel that this approach gives hope of settling 
the full question one way or the other. 

The truth of the operator algebra proposition is trivially implied by the 
truth of the group statement (see the proof of Theorem 7). On the other 
hand, a bounded representation of a group (discrete) can be extended to a 
representation of its LZ, algebra (as a self-adjoint algebra of operators acting 
by convolution on L.). This algebra of operators need not be closed in the 
uniform operator topology and the representation need not be extendable to 
the uniform closure of this algebra; so that it would appear that the truth 
of the algebra result would imply the group conjecture only for those bounded 
representations which are extendable to the uniform closure of the L, algebra. 
(The similarity which transforms the representation of the L, algebra into 
a * representation will transform the group representation into a unitary 
representation). However, it is possible to renorm the ZL, algebra in such a 
way that the completion of the resulting * algebra is a C*-algebra to which 
each bounded group representation is extendable, by assigning to each element 
of the L, algebra the supremum of the norms of its images in each * repre- 
sentation. It follows from the existence of a Banach algebra norm on L, 
in which the * map is isometric (viz., the L, norm) that this supremum is 
not greater than the Z, norm (and certainly finite). Our extended group 
representation, being a bounded algebra representation relative to the Z, norm 
on the LZ, algebra of the group, is a continuous representation of the L, algebra 
in the C* norm just constructed. Although the representation extended to 
this C*-algebra may not be an isomorphism, the kernel is a closed two-sided 
ideal so that the factor algebra is a @*-algebra [8], and the induced repre- 
sentation on this factor C*-algebra is an isomorphism. Thus we have the 


7 
| 
| 
] 


602 RICHARD V. KADISON. 


complete equivalence of the group and C’*-algebra questions. We are indebted 
to I. Kaplansky for bringing to our attention the known renorming device 
used above. It is possible that a more incisive operator algebra result would 
apply directly to the LZ, algebra (acting by convolution on L.). In [6], 
Mackey proves the algebra result in the commutative case by direct methods 
(this result follows at once, as in the proof of Theorem 7, from the fact that 
commutative groups have means, in the sense defined above; see [5]). 

The origin of the group question can be found in the classical statement 
which says that each representation of a finite group by (complex, real) 
matrices is equivalent to a representation by (unitary, orthogonal) matrices 
and its extension to continuous representations of compact groups [7]. The 
technique used in these proofs, invariant integration over the group, is almost 
identical with the technique of means. Using this theorem for compact 
groups the operator algebra result follows for finite-dimensional operator 
algebras (applied to the (compact) group of unitary operators in the algebra). 
Perhaps a more natural way of concluding the algebra result in the finite- 
dimensional case is thru the semi-simplicity of the image algebra. In this 
ease the various concepts of semi-simplicity coincide so that the semi-sim- 
plicity of the original C*-algebra, interpreted algebraically, is inherited by 
the image, and this, interpreted spatially, shows that this image is similar to 
a C*-algebra. In the infinite-dimensional case it is not at all difficult to 
construct an algebra of operators which is semi-simple in all the conven- 
tional senses but not similar to a C*-algebra. The following topological 
difficulty can occur: while each invariant subspace may have a complementary 
invariant subspace, the greatest of the angles between the given space and all 
possible invariant complements may tend to 0 for some sequence of invariant 
subspaces, thus ruling out what we may call the “topological semi-simplicity ” 
of the algebra. For a C*-algebra and a unitary group, the orthogonal com- 
plement of an invariant subspace is invariant. Our similarity problem for 
the given family of operators (group or algebra) amounts to an _ ortho- 
gonalization process. 

In Section 2, we begin by defining concepts of local semi-simplicity and 
bounded local semi-simplicity of group representations. Theorem 1 states 
that bounded local semi-simplicity of a group representation is necessary and 
sufficient for the representation to be similar to a unitary representation. 
Several different forms of a conjecture concerning the group question are 
discussed, with the aid of Theorem 1. To study the operator algebra question, 
we develop a device for measuring the deviation of a set of vectors from being 
an orthonormal set. After stating a condition for a representation of a (*- 
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algebra to be similar to a * representation in terms of the group condition 
(Theorem 1), we employ this device to give a more delicate criterion for 
topological semi-simplicity of an algebra of operators. In the concluding 
section, we discuss some extensions of the results stated, examples, and a 
class of natural questions an affirmative answer to any of which would yield 
the fact that all bounded operators on a Hilbert space have non-trivial, closed, 


invariant subspaces. 


2. Conditions for topological semi-simplicity. The following defini- 
tion contains a description of local behavior of a group representation, which 
is necessary and sufficient for the group representation to be similar to a 
unitary representation. The statement and proof of this fact are contained 
in Theorem 1. 


DEFINITION 1. <A representation g— A, of the group G by bounded 
operators Ay on the Banach space B is said to be “locally semi-simple” 
when, for each finite set of vectors in and °,9n of 
elements in G, one can find a linear transformation S defined on the finite- 
dimensional vector space generated by +, 2n, Such 
that || Sx; || = || SAg,a; ||; t—1,---,n. If there exists a constant M such 


that S can always be chosen satisfying 


1/M = inf {|| Sx ||: re || z | —1}; 


M = sup {|| 8z ||: | —1}, 
we say that the representation is “ boundedly locally semi-simple.” 


THEOREM 1. A representation g—A, of the group G by bounded 
operators Ag on a Hilbert space # is boundedly locally semi-simple if and 
only if it is similar to a unitary representation. 


Proof. The necessity of the condition is quite easy. Indeed, suppose 
that S is a bounded invertible operator on # such that SA,S-! is unitary 
for each g in G. Let in and in G be given. Since 


SA,S-? is unitary we have 


| SA, === | SA,,S 187; I = OX; 


which establishes the bounded local semi-simplicity of the given representation. 

Suppose now that the given group representation is known to be 
boundedly locally semi-simple and that M is a bounding constant. We 
establish, in the succeeding lemmas, the existence of an invertible operator 
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P with | P|, || P-* || not exceeding M and such that P-*A,P is unitary for 
each g in G. 


Lemma 2. If U ts a finite-dimensional subspace of & there exists a 
Hilbert space norm || |\’ on © such that || Aga |\’=|| Aya ||’ whenever A,r 
and Ayx are in VU and such that 1/M ||y||S||y |’ SM || y || for each vector 
y in UV. 

Proof. We endow the conjugate tensor product U® WU (i.e., the tensor 
product which is conjugate linear in the second variable, ay a(x@y)) 
with the natural inner product derived from the inner product of &, i.e. 
we define <r@ y,z@w> = (2x,z)(w,y) and extend the domain of definition 
of this inner product to all of US by bilinearity. It is well-known that 
this process gives rise to an inner product (independent) of the representation 
of the elements involved as a sum of elements of the form x @ y—(see [3], 
for example). 

Let U® be the subspace of U®WU generated by tensors of the form 
z®za, and let €@ be the subspace of W@ generated by vectors of the form 
A,y® Ajy—y@y, where y and A,gy are in VU. Choose a basis 


Agy: Agyi —Y1 @ AgmYm AgnYm— Ym Ym 


for €@. By the bounded local semi-simplicity of the representation g— A,, 
we can find a linear transformation S such that || SA,,y; || = || Sy ||; 
+=1,:--,m and 
1/M S inf {|| Sy ||:y in | =1}; 
M= sup {|| Sy ||:y in VU’, || y | = 1}, 


where ©’ is the subspace of generated by Ym, AgmYm 
By defining S to be a suitable scalar (between 1/M and M) on UCU, 
we obtain a linear transformation, which we denote again by S, defined 
on all of © and satisfying the same conditions as above with U replacing 
%’ (no difficulties can arise if we choose S so that S(U’) =U’ by con- 
posing the original S with a unitary transformation). 


Let z= > S*2;®@ S*z;, where is an orthonormal basis for 
S(U). Observe that 


n 


Ag. Ag yi— @ Yi) = (S* aj, (Ag yi, S*2;) 


j=l 


— (S*2j, yi) (yi, S*xj) = | SAg.yi) |? — | (a, Sys) |* 
j=1 


= || SAg.ys ||? — |] Sys |i? =0, 


| 
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as follows from the fact that SA,,yi, Sy; are in S(U), the Parseval equality, 
and the choice of S. Consequently z is orthogonal to €@. We define a 
new inner-product on © by means of (2,y)’—=<xr®@y,z> so that the new 
Hilbert norm on W% satisfies 


ly |’ >> (S*2;, y) (y, 


| (xj, Sy) = || Sy |. 


j=1 


It follows at once from this last equality that 1/M|y||S|y|’SM\|y |. 
If now y and A,y are in U then A,y® A,y— y @y is in €®@ so that 


0= —y@y,z) = (Agy — (y@y,z) 
= (| II")? — II)? 


or || Agy |’, If A,r and are in VU, then 
|| Aga ||’ = |! Age |’, which completes the proof of this lemma. 

The following lemma allows us to pass from our finite-dimensional 
information to information about the full space on which the A, operate. 


Lemma 3. Jf @ is a Banach space and p is a partition function on B 
such that on each finite-dimensional subspace 8, of B one can introduce a 
norm || ||" in which ®, is a Hilbert space, each partition class intersected 
with B, lies on the shell of some sphere center at 0 in the norm || ||’, and 
there exists a constant M (depending upon p) such that 


1/M 


for each x in B, (where || || is the norm on B); then the underlying vector 
space of B admits a norm, equivalent to the original norm, in which it is a 
Hilbert space and such that the partition classes of p each lie on the shell 
of some sphere center at 0 relative to the new norm. 


Proof. We form a product of intervals with @ as the indexing family. 
To each point x in 8, we make correspond the closed interval [|| 2||/M, M || z||] 
(thus to 0 in @ we make correspond the number 0). Denote by X the 
Cartesian product 


Il II [|| |/M, || 
zeB 


We consider X in its standard product topology, in which it is compact, where 
each /, is given its usual metric topology. Let @, be a finite-dimensional 
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subspace of @, and let ¥(@,) be the set of points of XY which as functions 
restricted to @, give rise to a Hilbert space norm on @, which is constant 
on the partition classes of p intersected with @,. We shall show presently 
that X(@,) is a closed subset of Y. Assume, for the moment, that we have 
proved this fact. The sets X(@,) have the finite intersection property 
(®, ranging over the finite-dimensional subspaces of @). Indeed, let 
®.,--°,B, be a (finite) set of finite-dimensional subspaces of @ and let 
8, be the (finite-dimensional) subspace they generate. By assumption, we 
can find a Hilbert space norm || ||, on @, which is constant on the partition 
classes of p intersected with @,, and which satisfies the inequality 


1/M | x | | ilo = M | 


| || 

for each z in @,. The function which assigns to each z not in @, the value 

M \\ x || and to each z in @, the value || ||, lies in which is clearly 
n 

contained in {| X(@;). It now follows from the compactness of XY (and 
i=l 

our assumption that the sets X(@,) are closed in X) that the intersection 

of all the sets ¥(@,) is not empty. Let || ||’ be a function on @ in this 

intersection. Then, on each finite-dimensional subspace of @, || ||’ induces 

a Hilbert space norm. It is immediate that || ||’ satisfies the norm axioms 

and the Parallelogram Law on @ as well as being constant on the partition 

classes of p, so that || ||’ is our desired Hilbert space norm on B. Of course 

|e 

the sets X(#@,) are closed in X. We shall omit this proof, however, since 


| |’ is in X. It remains to prove that 


, since 


it is a standard approximation argument of the type employed in the proof 
of the w*-compactness of the unit sphere in the conjugate space of a normed 


linear space. 


Proof of Theorem 1. As partition function p on 9 we take the map 
which assigns to each vector z in & the set of vectors {A,r:g in G}. Since 
the family of operators {A,} forms a group, this map defines a partition 
function on &. Lemma 2 establishes the hypothesis of Lemma 3 with this 
partition function and & for @, so that we can conclude the existence of a 


norm | ||’ on & in which & is a Hilbert space and such that || A,,z |!’ 
= || Ay,v ||’ for each z in & and g:,g2 in G. In particular || x ||’ = || A,z ||’ so 
that each operator A, is isometric with respect to the norm || |\’. Moreover, 


| |’ can be so chosen that || z ||/MS || x |’ = M || for each vector x in 


Let z,,- - - be an orthonormal basis for &# with respect to the norm || | 


| ( 
1 
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(and associated inner product ( , )), and let y,,- - - be an orthonormal basis 
for 94 with respect to the norm || ||’ (and associated inner product ( , )’). 
Define a linear transformation P of & into itself by Pr;x=y;; 1—1,- 
Then 
= (2 (2, i) Y, Xi) = (2, zi) (Zi. y) 


= (2 (2, Ys i) yi)’ = (Pa, Py)’. 


Of course (P-tx, P-ty) = (2, y)’, substituting Pax for « and for y 
throughout the above equality. We assert that P-*A,P is a unitary operator 
on &# with respect to the norm || || for each g in G. Indeed, 


(P*A,Px, P*AgPy) = (AgPx, AgPy)’ = (Pz, Py)’ = (2,9). 


We note in conclusion that || P ||, || P-* || do not exceed VM. In fact, if c= > a2; 
with is given, then || Say;|’—1 and 
|| Pr|’S=M| Pr, so that 1/MS| Pr| SM. 
There are several ways of formulating a conjecture concerning the classical 
question of whether or not each bounded representation of a group is similar 
to a unitary representation. 


CongecturE A. Every bounded representation of a group by operators 
on a Hilbert space is similar to a unitary representation. 


ConJgecture B. There exists a function f from the positive reals to the 
positive reals with the property that for each bounded representation g— A, 
with bound M, of a group G by operators on a Hilbert space one can find an 
invertible operator P such that P-\A,P is unitary for each g in G and such 
that || P || || do not exceed f(M). 


ConJEcCTURE C. Same as B with f as the identity transform. 


Each of the above conjectures is clearly stronger than the preceding one. 
We shall show that B is actually equivalent to A in the next lemma. 


Lemma 4. Conjecture A is equivalent to Conjecture B. 


Proof. Clearly B implies A. Suppose now that A is true. If B is false 
there exists a sequence of groups G,, G2,- - - and a sequence of representations 
A,o,: of these groups, respectively, each with bound 
M and such that if N;—inf{max(| P;||, | ||): unitary for 
each g() in G;} then lim Nj=o. Let G=G,®@G.®@- - - be the weak direct 


i 


4 
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sum of the groups G,, G2,- - -, and let g—» Ay be the direct sum of the repre- 
sentations A,w,- - -. The representation g—> A, of G has bound 
Assuming A, we can find an operator P such that P-*A,P is unitary for each 
gin G. Restricted to each direct summand, this similarity induces similarities 
of all the representations g“)-—» A,w, each similarity with bound not greater 


than max(|| P ||, || P-* ||)—a contradiction. Hence A implies B. 


THEOREM 5. Jf B is true for the free groups on finitely many generators 
then B is true for all groups. 


Proof. Let g—A, be a representation of G with bound M. We shall 
show that this representation is boundedly locally semi-simple with bounding 
constant f(M). In fact, let 2,,- - -,2, in and - in G be given. 
The group G, generated by g:,° - -,9n is the homomorphic image of F,, the 
free group on n generators. Thus the representation g—> A, of G restricted 
to G, gives rise to a representation of F,, with bound M which, by hypothesis, 
is similar to a unitary representation via an operator P with || P ||, || P-*|| 
not exceeding f(J/). As in the proof of Theorem 1, we now conclude that 
|| PAg,a; || = || Pz ||, -,n; so that the representation is boundedly 
locally semi-simple with bounding constant f(M). Hence, by Theorem 1, 
the representation g— A, is similar to a unitary represeniation via a 7’ such 
that || 7 ||, || Z-* || do not exceed f(/). Thus B follows for all groups. 
Note that the proof of Lemma 4 shows that assuming A for the class of 
groups generated by no more than a countable number of elements implies 
B for this class (since the group G constructed in the proof would be in this 
class). Now every group in this class is the homomorphic image of the free 


group on countably many generators, F,, so that assuming A for F,, implies 
A for all groups with a countable number of generators and hence B for the 
free groups on finitely many generators. With the theorem just proved, this 
yields: 


Coro~uary 6. If A holds for the free group on a countable infinity of 
generators then A and hence B holds for all groups. 


We turn our attention now to the question of topological semi-simplicity 
of algebras of operators. In Theorem 8, we state a necessary and sufficient 
condition for a representation of a C*-algebra to be similar to a * represen- 
tation. Before stating this result, however, it is necessary to introduce some 
geometrical concepts. In particular we must associate to each configuration 
of vectors an object which measures its deviation from being an orthonormal 
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set. To this end, we introduce an “inner product” between two sets of n 
vectors in #. This inner product has as its range of values, operators on #. 


DEFINITION 2. If (Y1,° Yn) are two n-tuples 
of vectors in & with U, W the spaces generated by Yn}, 
respectively, we denote by <z,%> the operator on & defined as follows. Let 
0 be the space of n-tuples of complex numbers with the usual inner product 
and let ¢,,- be the basis (1,0,- - -,0),- +, (0,- + -,0,1). Let P be 
the map of C" into determined by =2;, 1=1,- - -,n, and let Q be 
the map of C” into W determined by Q(e:) =yi. By Q* we mean the adjoint 
map to Q, from into (characterized by (Q*x,a) = (x, Qa), where x is 
an arbitrary vector in #, a in C*, the first inner product is taken in C", and 
the second in #). Then = PQ*. 


We note some of the properties of < , >. As a function on the product 
of &, (the n-fold direct sum of & with itself) with #,, this inner product 
is conjugate bilinear. Indeed with -,%n), = 
= PQ’* we have 


= (P + P’)(aQ)* = a(PQ* + P'Q*) = + <2, 9) 
and similarly 
With the notation as in the definition, we see that the range of <z,%> is 
contained © and that the range of <z,9>*—=<9,2> —=—QP* is contained 
in W%?. Since the null space of an operator is the complement of the range 
of its adjoint, we have that the complement of W% in & is the null space of 
<Z,9>. Thus, effectively, is a transformation from W to VU. 

We compute the transformation <Z,% > precisely. With the notation 
above we have, for z in #: 


(Q*s, ei) (z, Qei) (z, Yi), 


so that (z,y:)e;, hence PQ*2 (2, Thus <z,%> can be 


expressed symbolically as ( ,y,)ai:it-°--:+( ,Yn)@n. It is immediate from 
this, that <7, > is a positive operator on % (as it is from the expression PP* 
for <z,>), and as such has a (unique) positive square root. We denote 
this square root by <Z> and refer to it as “the geometrical norm of % (or of 
the configuration z-,- - -,z,).” The fact that ¢z> is the identity operator 
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on the n-dimensional space ©U (so that x;,- - -,z, are linearly independent, 


in particular) is equivalent to 


being the identity transformation on %U, which is equivalent to 2,,- - -,2, 
being an orthonormal frame. The spread of the spectrum of <x), in general, 
is a measure of how much 2,- - -,2, deviates from being a scalar multiple 
of an orthonormal set. In the one-dimensional case, i.e., with x,y vectors 
in #, we have <x,y>=( .y)z. If we restrict this operator to the one- 
dimensional space generated by z, it becomes multiplication by (2,y), the 
usual inner product of z and y. 

Suppose, now, that 2 is a C*-algebra and ¢ is a representation (not 
necessarily * preserving) of % by operators on a Hilbert space #. Employing 
Theorem 1, we obtain the following criterion for ¢ to be similar to a * 
representation. 


THEOREM 7. If U is the unitary group of the C*-algebra A, a necessary 
and sufficient condition for a representation & of X by operators on a Hilbert 
space & to be similar to a * representation of % is that restricted to U be 


a boundedly, locally semi-simple group representation of U. 


Proof. If ¢ is similar to a * representation there exists an operator P 
on &# such that P-4¢(U)P is unitary, for each U in U. Thus ¢ restricted 
to U is similar to a unitary representation of U; and ¢ is boundedly, locally 
semi-simple, by Theorem 1. On the other hand, if ¢ is boundedly, locally 
semi-simple as a representation of % then, by Theorem 1, there exists an 
operator P on & such that P-*(U)P is unitary for each unitary operator U 
in Y%. It now follows that A—+P¢(A)P is a * representation of Y. 
Indeed, the given map is an algebraic isomorphism of %. Suppose A is a 
self-adjoint operator in 2% of norm not exceeding 1. Then A~4(U,+U.) 
where U, A and U, =A —i(I — are unitary operators 
in Thus P-*¢(A)P =3[P4(U,)P + P*¢(U2)P] with P-*6(U,)P and 
P-*¢(U.)P unitary, so that 


[P7¢(U.)P]* = P]* = P= Pg (U*)P; i= 1,2. 
Thus 

[P*¢(A)P]* = + U.*))P =P 19 (A*)P = 
so that A—P-*¢(A)P takes self-adjoint operators in % into self-adjoint 


operators, and, therefore, is a * representation of QM. 
Making use of the foregoing concept of inner product between sets of 
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n vectors, it is possible to give a more delicate analysis to the question of 


which representations of C*-algebras are similar to * representations. If ¢ 
is a representation of a C*-algebra 2 by operators on a Hilbert space 3’ 
(9 acts on #) and $(A)a’—0 for some unit vector 2 in #’, then for 
each positive « one can find a unit vector « in & such that || Ar|| <e. 
Indeed, if @(A;)a’=0, 1=1,- + -,n one can choose the unit vector z so 
that || Aja || - -,n, i.e., the relations ¢(A;)2’ = 0 can be “ approxi- 
mately duplicated” with 2 and & via ¢. In fact, the set of operators A 
such that #(4)v’=0 forms a proper left ideal & in YM (proper, since 
¢(1) =I). If for each unit vector « in & one of A;x has norm not less 
than « then T= A;*A;=cJ. But T is then invertible and in & This 
contradiction implies the existence of the desired unit vector x. Given « > 0, 


n 

vectors in such that || ||? = 1, and relations = 0, 
i=1 

h=1,- -,m; it is even possible to find vectors in with 


n 
> || ||? —1 such that || S || h=1,---,m. This can be done by 


working with the n * n matrix algebras over % and $(%) as we did above 
with 2% and (2) themselves. On the other hand, suppose the relations 


n 

¢(Ani) =0, h=1,- subsist with 2,’,- - -,2,’ an orthonormal set 

i=1 

in #’; is it possible to choose x,,: * -,., an orthonormal set in 9% such that 
n 

| SAniti || <<e, h=1,---,m? This is not necessarily possible on two 
i=1 


grounds; a multiplicity consideration, or more simply, the dimension of # 
may not be large enough to accommodate an orthonormal set with n vectors, 
secondly, it is too much to ask for orthonormality of 2,,- - -,2x, in light of 
the fact that ¢ may not be a 
possible to choose 2,,° * *,%, an orthonormal set then ¢ is already a * repre- 


* representation (Theorem 8 shows that if it is 


sentation). The multiplicity question can be avoided by asking whether or not 


n 
a * representation of can be found (once the relations ¢(Aj:) = 0 
i=1 


and « > 0 are given) such that || Sy(An:)a || <«. As for the orthonormality 
4=1 


question, can we at least find bounds, dependent upon the representation ¢ 
alone, for the distortion of z,,- - -,2, from being an orthonormal set? The 
technique for measuring this distortion has just been developed. It is not 
difficult to see that if @ is similar to a * representation y via an operator P, 
then y will serve for the exact duplication of all relations with the distortion 
bounded by max(|| P |, || P-* ||) (this will be done in detail in the necessity 
portion of Theorem 8). These considerations lead us to: 
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DeFINITION 3. Let be a representation of the C*-algebra YU by 
operators on the Hilbert space and let = be an n-tuple 
consisting of vectors 2,,° * *,%n which form an orthonormal set in &# such 
that (¢(Aij))Z=0, where ($(Aij)) is an n Xn matric whose entries are 
operators in o(M). Let w be a * representation of M by operators on a 
Hilbert space 9’, and let = - be an n-tuple of vectors in 
such that || (W(Aij)z || <<, where « is some positive number and the spectrum 
of <z’>, as an operator on the space generated by 2,’,- : -,2'n, 18s contained 
in the interval [k,K]. We say, then, that “ || (W(Aij) )& || << ts a self-adjoint 
« cover of the relations (¢(Ai;))Z=0 with distortion in [k,K].” If there 
exist constants k, K, (K>k>0) such that each relation of the form 
($(Ai;) )2=0, with Z as above and (Ay) a positive operator (in the C*- 
algebra consisting of n X n matrices over M), has, for each positive e, a self- 
adjoint « cover with distortion in [k,K], we say that “the representation $ 
has a self-adjoint cover (with distortion in [k, K]).” 


We have not made the definition of a representation having a self-adjoint 
cover as restrictive as we might, in that we require only relations coming from 
positive m Xn matrices to have self-adjoint « covers. This is all that is 
needed for each relation to have a cover. It might seem more natural to 
use the phrase “¢@ has a self-adjoint cover” to mean that for each e there 
is a self-adjoint representation which serves as a self-adjoint « cover of ¢ for 
all relations. That this actually follows from the weaker condition used and, 
indeed, that there is a self-adjoint representation which works for all positive 
e and all relations is the substance of: 


THEOREM 8. A necessary and sufficient condition for a representation ¢ 
of a C*-algebra XM by operators on a Hilbert space & to be similar to a * 
representation is that @ have a self-adjoint cover. If the distortion is in 
[k,K] then a similarity can be effected by a positive operator with spectrum 
in [k, K]. 


Proof. The necessity presents little difficulty. Suppose that there exists 
an invertible operator T on # such that A—>T7¢(A)T™ is a * representation 
of and let M—max(|| 7, || 7-*||). If (¢(Aij))Z—0 is some relation, 
with z= (%,° -,2n), *,2, an orthonormal set, then = 0 
is a self-adjoint cover for this relation (all > 0), where = (Tx,,:--, 
and where the distortion lies in [1/M,M]. Indeed, that (7'(Aj;)T-)z’ =0 
with the given z’ is immediate. Let P be the linear transformation from (” 


int 


on 


Th 


inf} 


so t 


Let 


» the 


and 


with 


612 

Th 
tha 
self 
In 
len; 
pro 
( 
we 
spa 
ope 
for 
VU 
in t 
of t 
a be 
E® 
|_| 
| | 

| 
| 


é OPERATOR REPRESENTATIONS. 613 
into defined by Pe; (see Definition 2), and let be the projection 
on the space generated by 2,° @n. Then 

= TPP*T* = TET* = (TE) (TE)*. 

Thus || <2’, 2’> || = || TE ||? S| T |? Moreover 


2,2): || in TEM} =inf{|| (TEL) 


=inf{| ET*Ty in E9, | Ty | =1) inf{(T*Ty, 
=inf{| Ty |*/|y inf(1/|y in BS, | Ty 


: Thus the spectrum of <2’, 2’) as an operator on TH lies in [1/M?, M*] so 


that the spectrum of ¢Z’> lies in [1/M,M], and (T¢(Ai)T*)z =0 is a 
self-adjoint « cover (alle > 0) for (¢(Ai;) with distortion in [1/M, M]. 
In connection with foregoing inequalities, note that y is in H& so that the 
length of the projection of T*Ty upon L& is not less than the length of the 
projection of T7*7'y upon the subspace generated by y (this length being 

Suppose now that the map ¢ has a self-adjoint cover. As in Theorem 1, 


we show that each finite-dimensional subspace ©U of & admits a Hilbert 
space norm || ||’ such that || ¢6(U)z ||’= || ¢(V)a ||’ when U, V are unitary 


operators in with ¢(U)z, in VU, and such thatk || y | S| y|’SK|y] 
for each y in ©. Following Theorem 1, form the conjugate tensor product 
U®WU of U with itself and endow it with the unitary structure described 
in that theorem. Let U® be the subspace of U@WU generated by tensors 
of the form x ®@ zx and € ® the subspace generated by elements ¢(U)x#®o(U)x 
—zr@z, where U is a unitary operator in & and z,¢(U)z are in VU. Choose 
a basis (U1) @ y:— +, 6(Um) ym 6(U m) Ym — Ym @ Ym for 


€@ and an orthonormal basis z,,- - -,2, for U. We have 
$(Ui) yi => Bz; and Yi 1,- 
j=1 j=1 
so that 


j=1 j=l 


Let a positive integer r and a positive number 8 be given. We wish to establish 


; the existence of a * representation y of 9 as operators on a Hilbert space 9’ 


and vectors 2,’,- - -,2,’ in 9’ such that has its spectrum in [k, K], 
with = (2,’,- - -,2p,’), and such that 
| Bgl) || <8; t=1,---,m. 
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We write Aj; for h=1,--+,m and $(A)~ for the nXn 


™m 
matrix whose i,j entry is the operator } (Ani*)¢(Anj). Now $(A) 2=0, 
h=1 


| 


m | 


| 
| b(Am*) -0| 


and 
| 
h—=1,:--,m. By hypothesis on ¢, the relation ¢(A4)Z—0 has a self. 


adjoint 8 cover with distortion in [k,K]. Let y be a representation of 2 by f 
operators on the Hilbert space Mf’ and let 2’ = (2,’,- - -,2n’) be a vector such f 
that <z’> has spectrum in [k,K] and || y(A)2z’ || <8*/n4K where (A) is 


the n X n matrix whose 1,7 entry is }w(Ani*)W~(Ani). In particular then, 
h=1 


for 


4=1 i=1 
In fact, since <z’> has spectrum in [k,K], <2, = has spectrum 
in [k*. K?], so that 
<2’, | = || PP* || P PSK? and | | Pe | Sk 

(notation as in Definition 2). Now (¥(A) 2,2’) = 


W(Ann*)0--- 0 


W(Ani) 
0 0 
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Thus | Sw (Anja || <8; h=1,---,m. Let be the subspace of 


0, generated by 2,’,- +, and let y,’,- - yn’ be an orthonormal basis for U’. 
Denote by S the linear transformation determined by Sy—2. We assert 


that SS* = <7’, Z’> = > ( ,2/)a/. Indeed, (S*x/, y/) = (a7, = z/'), 


nm 


so that S*x’ => (2/,2/)y/ and 
j=1 
, ud , , , , , , 
SS*z/ = > Sy/ = > (al, = <2’, 
j=1 g=1 
Thus, since 2,’,- - -, 2,’ span VU’, SS* = <2’, as asserted. It follows that 


SS* has its spectrum in [k*, K*] from which, S-**S-? has its spectrum in 
[1/K*,1/k?], so that | S|| and || 8?||=1/k. Let S*y/ ayy, so 
i=1 


that the matrix of the transformation S* relative to orthonormal basis {y;’} 


n 

ff is (a;). In define y/’ => We set up a unitary transformation 
4=1 

) 

» between © and ©’ by means of the map 2;—y;. Under this map, we see 


that y/’—> S*y/, so that 


| (x ® S*y/, [> Bin’ yn’ | [> Bin’ yn’ | [2 Binyn’ | @ [2 Binyn’ | 


=| > (S*y/, Bin’yn’) (= Bi’yn’, S*y/) 
j=l h=1 h=1 


nf > (S*y/, Bagn’) (> Binyr’, S*y/) | 
=| >> Bin’) | >> ) | 


<| Bins’ — Bin’an’) || [| ¥ (Ui) Ba’ 1] 


< || Su(Aa)ay’ || (2nBK) 
h=1 


where 6 = max {| Bu |,| Bu’ |; t—1,---,m; h=1,- - -,n}. 


In connection with the above inequality, note that we have proved that 
|z/ || SK, and that y(U,) is a unitary operator on #’ since y is a * repre- 
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sentation (with y(J) We now specify the choice of 8 as 1/2nKr 
(all the constants that appear in this choice were determined before the : 
introduction of 8). Our inequality becomes then: 


j=1 


We write y;(r) for y;/” to indicate the dependence of the yj’ upon r. Observe 
that y;(7r) lies in the sphere of radius K and outside the sphere of radius f, 
center at 0, in U since y;(r) is the image of S*y; under our unitary map 
between and (and || y/ || =1, || S* K, || S**||S1/k). By com. 
pactness, one can choose a subsequence {7,} of 7’s such that “s Yj(Tr) =4; 


g=ml,---,n. Clearly 


(Sz Yi yi) =0; t==1,---,m, 
j=1 


and the z; lie between the spheres of radii k and K with center at 0 in %. 


We consider the norm || ||’ induced on U by means of the definition: 
(|| zl’)? = @zr) => | (2,2) |?. 
j=l j=l 


We have just proved that 2B: is orthogonal to E€® so that || ¢(U)y|' 


=||y||’ if both ¢(U)y y are in VU. Thus, if d(U)y and 
= ¢(VU*)¢(U)y are in U then || d6(U)y ||’ = || 6(V)y ||’. For x an arbitrary 
vector in ©, we have (|| 2 |’)? = 


(> @ @ x) = lim Sn*yj(h) @ Sn*y/(h), @ x’(h)) 
j=1 h j=1 


lim |(y/(h), Lim | Sya(h) 


where y;(h) are the vectors corresponding to yj in the foregoing discussion 
(with 7, now replacing r) S, is the S of that discussion and 2’(h) is the f 
image of x under the unitary map between UV and WU,’ of the present dis- 
cussion. Now 


lim || S,2’(h) ||? Slim K? || 2’(h) |? = lim K? || |? = K? | 
h h h 


=k 


lim || Spa’ (h) ||? = lim || 2’(h) ||? = lim |] 
h h h 


so that k||z| S| zc}. 
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If we take as partition classes in # the sets {6(U)zx: U a unitary operator 
in %}, we arive at a situation satisfying the hypotheses of Lemma 3, so that 
9{ admits a Hilbert space norm in which ¢(U) is unitary for U a unitary 
operator in Y (this norm equivalent to the original norm with constants 
k, K). Thus, as at the end of the proof of Theorem 1, we can find an operator 
P with P-'¢(U)P unitary for each unitary operator U in %f and || P ||, || P-* || 
do not exceed max (K,1/k). It now follows, as in the proof of Theorem 7, 
that the representation A— P-'¢(A)P is a * representation of %. Writing 
the polar decomposition HU, U unitary, H=(PP*)4 for P, we have 
A—H"4(A)H is a * representation of 2% with H positive and having 
spectrum in [k, K]. 


3. Concluding remarks. The discussion preceding Theorem 8 and 
Definition 3, concerning the approximate duplication of relations draws very 
heavily upon the fact that the initial algebra is a C*-algebra (in particular, 
is uniformly closed) for the fact that an invertible operator in the algebra 
has its inverse in the algebra. On the other hand, Definition 3 and Theorem 8 
apply as they stand to self-adjoint (not necessarily closed) algebras (although 
they are not stated this way). It follows immediately from this that: 


CoROLLARY 9. A representation of a group by bounded operators on a 
Hilbert space is similar to a unitary representation if and only if the extension 
of this representation to the (finite, translation) group algebra (acting on L, 
of the group) has a self-adjoint cover. 


Despite the applicability of Definition 3 and Theorem 8 to self-adjoint 
algebras which are not uniformly closed, it should not be felt that the general 
conjecture about operator algebras has application to the non-closed, self- 
adjoint algebras. That is, examples are easily constructed of algebras which 
are not similar to self-adjoint algebras but are algebraically isomorphic to 
non-closed, self-adjoint algebras (not the continuous image, of course). In 
fact, let 2,22,- - - be a sequence of linearly independent unit vectors which 
tend (strongly) to z and which span the Hilbert space 9. Let M be the 
algebra of bounded operators A on &# which have the form Az;— «2; for 
some sequence {a} of complex numbers, and let 3 be the set of sequences 
which arise in this manner (3 contains all sequences which have only a finite 
number of non-zero terms). Let y;,¥2,: - - be an orthonormal basis for # 
and let 9’ be the algebra of operators B of the form By; = ay; where {a;} is 
in 3. Then Y’ is a self-adjoint algebra containing J, for B*y;— ay, and 
{a} is in d if and only if {a;} is in 3 (the zs being so chosen that the 
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transformation + +--+ is bounded). More- 

over the map of onto YW’ where and By, = ay; is an 

algebraic isomorphism (which is continuous, since || A || =sup|a|— || B |). 
i 


For each invertible operator P and each operator A in YU, the operator PAP 
has P-1z; as eigenvectors and these converge to Px. Now the algebra 2% is 
commutative (as is P*9%P) so that, if P*MP is self-adjoint then P-*AP is 
normal for each A in Mf. Given ij we can easily find a sequence {a} in 3 
with a; (let a4, =1, a, for p71). Let A be the operator in with 
sequence {a}. If is normal then and P-*z; are orthogonal. 
Thus if PMP is self-adjoint it follows that P-12;,, 1—1,2,--- is a set of 
mutually orthogonal vectors, which we have just seen cannot be the case. 

We commented briefly, in the introduction, on the topological difficulty 
present in the infinite-dimensional case concerning the geometrical interpre- 
tation of semi-simplicity. By making suitable corrections for this difficulty, 
one arrives at a geometrical condition which might suffice for an algebra of 
operators to be similar to a self-adjoint algebra of operators. The conjecture 
obtained is quite natural in that it corrects for all the immediately visible 
difficulties which occur in passing from the finite to the infinite-dimensional 
case. For the moment, we specifically avoid describing the topology in which 
the operator algebra in question is closed. 


Let X be an algebra of operators on a Hilbert space with the property 
that there exists a positive § such that if VU ws a closed subspace (setwise) 
invariant under the operators of MX then there exists a complementary closed 
invariant subspace W (i.e., U+W is the whole space and UN W= (0)) 
which makes an angle greater than § with VU. Is M similar to a self-adjoint 
algebra? 


Note that since the angle between W? and VU is assumed to be positive their 
linear sum is closed. Let us assume that the answer to this question is yes 
(with any closure assumption on 2%) and that A is a bounded operator on 
the Hilbert space 9 with no closed invariant subspaces other than (0) and #. 
Let be the (commutative) algebra generated by A and the identity operator 
(the closure taken in the appropriate topology). Since 2% has no closed 
non-trivial invariant subspaces, the hypothesis is vacuously satisfied and 
there exists an operator P such that P%P is self-adjoint. Since P*2P 
is commutative, it consists of normal operators. In particular PAP is 
normal and has an abundance of non-trivial, closed, invariant subspaces. 
If U is such a subspace then PV is non-trivial, closed and invariant under 
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A—a contradiction. Again, if Y& is an irreducible algebra of operators then 
the hypothesis are trivially satisfied, and an affirmative answer to the question 
would imply that % is similar to a self-adjoint algebra. Making use of this 
remark, we can answer the question in the uniformly closed case negatively. 
Our own approach to this counter-example rested upon producing a uniformly 
closed irreducible operator algebra containing an invertible operator whose 
inverse didn’t lie in the operator algebra (note that this can’t occur in an 
algebra which is similar to a C*-algebra). A much more cogent device was 
suggested to us by I. Kaplansky. Using the completely continuous operators 
as a basic irreducible set of operators, build a closed operator algebra over it 
whose quotient by the completely continuous operators is a (finite-dimen- 
sional), non-semi-simple algebra. The larger algebra is not even the iso- 
morphic image of a C*-algebra, for a quotient algebra of a C*-algebra is again 
a O*-algebra [8] and therefore semi-simple. A concrete example is obtained 
by taking as our algebra the algebra generated by the completely continuous 
operators, the identity operator, and a nilpotent operator of index two (say a 
partial isometry between an infinite-dimensional subspace and its orthogonal 
complement). 

In conclusion, we note the simple fact that a representation of a group 
by uniformly bounded operators each of which is normal is itself a unitary 
representation. In fact, an invertible normal operator all of whose powers 
form a set which is uniformly bounded in norm must have its spectrum on 
the unit circle and is therefore unitary. 


Added in proof (June 1, 1955): In a recent note, (Proceedings of the 
National Academy of Sciences, vol. 41 (1955)) F. Mautner and L. Ehrenpreis 
announce that the group question has a negative answer, i.e., they produce a 
group and a bounded representation of it which is not similar to a unitary 


representation. Presumably, then, the “distortion continuity” condition of 


Theorem 8 cannot be removed. Restricting attention to relations involving n 
or fewer vectors, we can discuss representations satisfying an “n-distortion 
continuity ” condition—the boundedness of a group representation (or con- 
tinuity of a C’*-algebra representation) amounts to “1-distortion continuity.” 
We feel that there are groups and representations of them which have n but 
not 1 distortion continuity. 
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